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1. What is Galois theory? 

Originally, the equation + \ — had no solution. Then the two solutions i 
and —i were created. But there is absolutely no way to tell who is i and who is 
—i} That is Galois Theory. 

Thus, Galois Theory tells you how far we cannot distinguish between the roots 
of an equation. This is codified in the Galois Group. 

2. Galois groups 
More precisely, consider an equation 

F" + aiF"-! + • • • + a„ = 

and let ai, . . . ,q;„ be its roots, which are assumed to be distinct. By definition, 
the Galois Group G of this equation consists of those permutations of the roots 
which preserve all relations between them. Equivalcntly, G is the set of all those 
permutations a of the symbols {1, 2, ... , n} such that 4>{a„(i), . . . , ac(n)) = for 
every n- variable polynomial for which (/'(q!i, ...,«„) = 0. The coefficients of cf) 
are supposed to be in a field K which contains the coefficients ai, . . . ,a„ of the 
given polynomial 

/ = f{Y) = r" + aiF"-! + ■•• + «„. 

We call G the Galois Group of / over K and denote it by GalY(/, K) or Gal(/, K). 
This is Galois' original concrete definition. 

According to the modern abstract definition, the Galois Group of a normal ex- 
tension L of a field K is defined to be the group of all if-automorphisms of L and 
is denoted by Gal(L, AT). Note that a normal extension L of a field if is a field 
obtained by adjoining to K all the roots of a bunch of univariate polynomials with 
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coefE-cients in K. To relate the two definitions, let L = K{ai, . . . , an) and note that 
we get an isomorphism of Ga.l{L, K) onto Gal(/, K) by sending any r e Gal{L, K) 
to that a G Gal(/, K) for which T{ai) = 0:^(1) for 1 < i < n. 

3. Permutation groups 

The above concrete definition brings out the close connection between group 
theory and theory of equations. To wit, the Galois Group Gal(/, K) is now a 
subgroup of Sn where, as usual, Sn denotes the symmetric group of degree n, i.e., 
the group of all permutations of n symbols; note that the order of Sn is n\. Quite 
generally, a subgroup of Sn is called a (permutation) group of degree n. Here 
the use of the word degree is meant to remind us that potentially it comes from 
an equation of degree n. To convert this potentiality into actuality, in various 
situations, constitutes Inverse Galois Theory. To further bring out the parallelism 
between group theory and the theory of equations, we note that. 

(1) / is irreducible iff Gal(/, K) is transitive. 
Here, a permutation group G "acting"^ on the set O = {1, 2, . . . , n} is transitive if 
for all i,i in f7, there exists o" G G such that (j{i) = j. Likewise, G is 2-transitive 
(or doubly transitive) if for all i ^ i' and j j' in fi, there exists a G G such 
that a{i) = j and cr(i') = /. Quite generally, G is l-transitive for a positive integer 
Z < n, if for all pairwise distinct elements ii,i2, ■ ■ ■ ,ii in f2 and pairwisc distinct 
elements ji,j2, ■ ■ ■ ,ji in ^, there exists a € G such that a{ie) = je for 1 < e < /. 
This brings us to MTR, i.e., the method of 



4. Throwing away roots 

Assuming / to be irreducible in let us "throw away" a root of /, say ai, 

and get 

/i = h{Y) = J^^^ = + 6iy"-2 + . . . + bn-1 e K{ai)[Y]. 
{Y - ai) 

In continuation of (1), we see that. 

(2) / and /i are irreducible in K[Y] and if(ai)[F] respectively iffGal(/, if) is 
2-transitive. 

It may be noted that, assuming / to be irreducible, it does not matter which 
root of / we throw away; for instance, the irreducibility of /i in /ir(Q;i)[y] and, up 
to isomorphism, the Galois group Gal(/i, ii'(ai)) are independent of which root we 
call ai. 

Likewise, by throwing away s roots of /o = / we get 
/. = fs{Y) = ^^^^ 



(r-ai)---(r-a,) 
= y"-« + + --- + dn-s& K{ai, as)[Y] 

and then: 

(3) fs is irreducible in K{ai, . . . , Q!s)[y] for < s < Hff Gal(/, K) is Z-transitive. 



^Self-advice: Don't be so scared of the term "acting." It is simply the modern substitute for 
"permuting." 
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5. Classification theorems 

Now you would have thought that you could (easily or possibly) construct a 
polynomial / = /o, say of degree 20, such that /o, /i, • • • , /g are irreducible whereas 
/lo is reducible. But No! And that is the surprise of the century. You cannot!! So 
says the CT, i.e., the recently established Classification Theorem of Finite Simple 
Groups, which was a magnificent piece of "team work" . According to the staggering 
statistics as reported by Coach Gorenstein [G3], the CT took 30 years (1950-1980), 
100 authors, 500 papers, and 15000 pages! Yet several hundred more pages are 
required to prove the implications 

CT ^ CDT ^ CTT CQT CFT ^ CST 

where CDT (resp: CTT, CQT, CFT, and CST) stands for the Classification The- 
orem of Doubly (resp: Triply, Quadruply, Fivefold, and Sixfold) transitive permu- 
tation groups.^ Promising to come back to CDT to CFT in a moment, let us state 
CST. It simply says that the symmetric group Sn for n > 6 and the alternating 
group An for n > 8 arc the only sixfold transitive groups!! Vis-a-vis equations, 
what we are saying is that if / is an irreducible polynomial of degree n > 7 such 
that /i, /2, /a, /4 and are irreducible then so are fa, /t, • • • , /„-3-^ 

To take a first shot at CT, in addition to permutation groups, we should also 
consider groups of matrices over the (Galois) Field GF(g) of q elements where q is 
a power of a prime. So let 

Gh(rn,q) = the general linear group of degree m over GF((7) 
=the group of all nonsingular to by to matrices 
with entries in GF(g). 

Here the multiplicative group GF(q')* comes in two ways. Firstly, thinking of 
scalar matrices, GF((j')* becomes a normal subgroup (and, in fact, the center) 
of GL(to, g). Secondly, taking determinants we get a surjective homomorphism 
GL(m, g) — > GF(g')*. This motivates the definitions 

PGL(to, q) =the projective general linear group of degree to 
over GF(q) 
= GL(TO,g)/GF(g)* 

and 

SL(to, g) = the special linear group of degree to over GF(g) 
= ker GL(TO,g) ^ GF{q)*. 

^This is an expanded version of a lecture given at Walter Feit's 60"^ birthday conference in 
Oxford, England. In addition to Walter Feit, the audience included the group theorists Peter 
Cameron, Michael Collins, Sandy Green, Graham Higman, Peter Neumann, Ron Solomon, and 
John Thompson; my talking group theory in this meet of topnotch group theorists was like carrying 
coal to Newcastle, or bringing holy water to the Ganges!! 

■^Self-Challenge = challenge to the extoUers of high school algebra: prove that by high school 
algebra if you can! Of course we can simply decree CT be high school algebra!! 
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Combining these two roles of GF(q')* we get 

PSL(m, q) = the projective special linear group of degree m 
over GF(g) 
= SL(m, 9)/(SL(m, q) n GF{q)*). 

In group theory parlance^ 

Lm{q) = the linear group of degree m over GF(q') 
= FSL{m,q). 

In yet another notation 

Ar,{q) = Ln+i{q) = PSL(n + 1, q). 
With these preliminaries, 

CT essentially says that An and An{q) are the only (finite) simple groups. 

Here we have to exclude "small" cases; namely, from the alternating group An 
exclude n < 4, and from An{q) exclude n = 1 and q < 3 (we define An{q) only 
for n > 1). Moreover, "essentially" means that with A„(g) we have to include 
its relatives and incarnations, to be discussed later. Finally, in addition to these 
infinite families, there are 26 "sporadics", again to be discussed later. ^ 

6. Brief thirty year history 

First, in the fifties, there was the fundamental work of Brauer [B] and Chevalley 
[Ch]. In 1962 this was followed by the path-breaking odd order paper of Feit and 
Thompson [FT]. Then came the largo team coached by Gorcnstcin [G4]. At any 
rate, we are meeting here to felicitate our friend Walter Feit on the occasion of his 
forthcoming sixtieth birthday.'^ 

7. Primitive groups 

As another example of the parallelism between group theory and theory of equa- 
tions, let us note that a permutation group G is said to be primitive if it is transitive, 
and the one-point stabilizer Gi of G is a maximal subgroup of G. Here we are as- 
suming G to be a subgroup of the symmetric group Sn acting on {1, 2, . . . , n}, and 
then by definition, Gi = G fl 5„_i with Sn-i = {tr e 5„ : a{l) = 1}. Just as it did 
not matter which root of the irreducible equation wc threw away, so in the present 
situation, if G is transitive then we may replace Gi by any Gj = {a G G : a{i) = i}, 
which is called the stabilizer of i in G. Clearly. 

(4) If / is irreducible and G = Gal(/, K) then Gi = Gal(/i, K{ai)). 

''In learning group theory, I am following the traditional Indian method; memorize things by 
heart and the meaning will eventually be revealed to you. Moreover, every subject has its lingo. 
Thus GL, SL and PSL are the Tom, Dick, and Harry of group theory. 

®Does the number 26 vindicate the spread of the English language which has exactly that 
many letters? 

^My fondest memory of Walter is that in 1957, when wc were both at Cornell, we decided to 
go on a diet together and the one who lost more weight was to get a quarter. At the end of one 
month, Walter gained one pound and I gained two. Who lost and who won? 
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Moreover; 

(5) Gal(/, K) is primitive iff / is irreducible and there is no field between K and 

Kiai). 

Not to get completely lost in group theory, let us revert to algebraic geometry 
by talking about 

8. Fundamental groups 

In my 1957 paper on "Coverings of Algebraic Curves" in the American Journal 
[A3], I considered the algebraic fundamental group 7rA(C) of a nonsingular curve C. 
Here C is allowed to be "open", i.e., it may consist of a projective algebraic curve 
minus a finite number of points. Wc assume that C is irreducible and defined over an 
algebraically closed ground field k. By nA{C) we mean the family of Galois groups 
Gal{L/k{C)) as L varies over all finite normal extensions of the function field k{C) 
of C such that no point of C (equivalently, no valuation of k{C)/k having center 
on C) is ramified in L. 

In case C is the (affine) line Lk over fc, or more generally if C = Lk^j. = the 
line Lk minus r points Ai, . . . , (with A^ ^ Xj in k for 1 < i < j < r) then this 
amounts to considering Gal(F, k{X)) where 

F = F(x, y) = y" + + ■ • • + 0„(a) 

is a bivariate polynomial with coefficients ^i(X), . . . , (pni^) in k[X] such that F is 
unramified at all (finite) values of X other than Ai, . . . , A^, i.e., such that for every 
A in A; different from Ai , . . . , A^ we have 

F{X + X,Y)=ll{Y-rj^^\x)) 

with r]^^\x) in the (formal) power series ring fc[[A']]. We may call F an unramified 
covering of Lk^r- 

For any group G, let Gh denote the family of finite homomorphic images of G. 
Let J^r be the free group on r generators, and let Jr be the family of all finite 
groups generated by r generators, and note that then J,. = J^rh- By the Riemann 
Existence Theorem etc., we see that if /c = the field of complex numbers C, then 
for any (irreducible) nonsingular curve C over k we have 7r^(C) ~ T^i{C)h where, 
as usual, 7ri(C) denotes the (topological) fmidamcntal group of C; sec Serre [SI]. 
Hence in particular TTA{Lc,r) = '^i{Lc,r)h, and clearly 7ri(Lc,r) = ^r- Therefore 

T^AiLc.r) = Jr- 

By taking r = or 1 in the last equation, we get 7r^(Lc) = Jo = {1} and 
'^a{Lc,i) = Ji = the family of all finite cyclic groups. These two facts can also 
be proved purely algebraically, by the genus formula due to Hurwitz-Riemann- 
Zeuthen; see Serre [S2] . The said genus formula actually shows that 

7ri(ifc) = Jo = {1} 

and 

7r^(Lfe,i) =Ji[char k]* 

= the family of all finite cyclic groups with order 
nondivisible by char k, 
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where char k is the characteristic of k and where; for any (irreducible) nonsingular 
curve C over any algebraically closed field k we define 

7r^(C) = the globally tame fundamental group of C 

= the family of all the members of 7rA(C) whose order is 
nondivisible by char k 

and for any nonnegative integer m and any family of finite groups J we put 

J[m]* = the family of all the members of J whose order is nondivisible by m. 

9. Quasi ^-groups 
Given any prime number p we put 

Q{p) = the family of all quasi p-groups, 

where by a quasi p- group we mean a finite group which is generated by all of its 
p-Sylow subgroups, and for any finite group G we put 

p{G) — the (normal) subgroup of G generated by all of its p-Sylow subgroups 

and for any family of finite groups J we put 

J{p) = the family of all finite groups G such that G/p{G) G J 

and we note that then Jo{p) = Q{p) and for every nonnegative integer r we have 

Jr{p) =JrW*(p) 

= the family of all finite groups G such that G/p{G) is generated 
by r generators 

and 

{p{G) : G G Jrip)} - Q{p). 

10. Conjectures 

For any algebraically closed field k of characteristic p ^ 0, in the above cited 
1957 paper, I conjectured that iTA{Lk,r) = A{Lc,r){p) , i-e., equivalently, 

General Conjecture. For every nonnegative integer r we have TrA{Lk,r) = Jr{p)- 
Hence in particular 

Quasigroup Conjecture. iTA{Lk) = Q{p)- 

Now a (finite) simple group whose order is divisible by p is obviously a quasi 
p-group, and therefore QC (= the Quasigroup Conjecture) subsumes the 

Simple Group Conjecture. 7r^(Lfe) contains every simple group whose order is 
divisible by p. 

In particular 
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Alternating Group Conjecture. For every integer n> pwe have An & 7rA(ife) 
except when p = 2 < n < 5. 

Here, as usual, An denotes the alternating group of degree n, i.e., the group of 
all even permutations on n symbols, and we note that the order of An is n!/2 or 
1 according as n > 1 or n = 1. As proved by Galois, An is a simple group for 
every n > 4, and hence AGO (= the Alternating Group Conjecture) is a special 
case of SGC (= the Simple Group Conjecture) except when {p,n) = (3,4); in this 
exceptional case, A„ is obviously a quasi p-group and so we can directly fall back 
upon QC. The symmetric group Sn of degree n is obviously a quasi 2-group for 
every n, and hence QC subsumes the 

Even Prime Symmetric Group Conjecture. If p = 2 then for every integer 
n > 2 we have 5„ e iTAiLk)- 

To match up with EPSGC {— the Even Prime Symmetric Group Conjecture), 
let us divide AGC into EPAGC and OPAGC, i.e., into the following two conjectures 
respectively. 

Even Prime Alternating Group Conjecture. If p = 2 then for every integer 
n>2 except when 2 < n < 5 we have An G nA{Lk)- 

Odd Prime Alternating Group Conjecture. If p > 2 then for every integer 
n>pwe have An € WAiLk)- 

11. Again some history 

Let ^ o G fc = an algebraically closed field of characteristic p ^ 0, and let n, s, t 
be positive integers such that t ^ 0(p). Now in support of the above conjectures, 
in the above cited 1957 paper, I had written down the following two examples of 
unramified coverings of Lfc: 

F„ = y" - aX'Y^ + 1 with n=p + t 

and 

Fn = Y"- oY^ + X' with t<n = 0{p) and GCD(n, t) = \&nds = 0(i) 

and had suggested that their Galois groups G„ = Gal(F„, k{X)) and G„ = Gal(F„, k{X)) 
should be calculated. 

Now, after a gap of thirty years, with Serre's encouragement and with the help 
of CT, I can calculate these Galois groups, and the answers are as follows. 

(I) 

(1.1) If i = 1, then G„ = PSL(2,p) = PSL(2,n- 1). 

(1.2) If i = 2 and p = 7, then G„ = PSL(2, 8) = PSL(2, n - 1). 

(1.3) If t = 2 and p 9^ 7, then G„ = A„. 

(1.4) If t > 2 and p ^ 2, then G„ = 

(1.5) lip = 2, then G„ = 5„. 
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(11) 

(11.1) If 1< i < 4 and p ^ 2, then G„ = A„. 

(11.2) If 1 < t < n - 3 and p 7^ 2, then G„ = A„. 

(11.3) If 1 < t = n - 3 and p 7^ 2 and 11 7^ p 7^ 23, then G„ = An. 

(11.4) Ifl<f<4<n and p = 2, then G„ = A„ or Sn- 
{11.5) Ifl<i<n-3 and p = 2, then G„ = A„ or 5„. 
Actually, F„ is a slight generalization of the original equation 

F„ = - XF* + 1 with n=p + t 

written down in the 1957 paper. This equation F„ was discovered by taking a 

section of a surface extracted from my 1955 paper "Ramification of Algebraic Func- 
tions" in the American Journal [Al], which was the second part of my Ph.D. Thesis 
written under the able guidance of Oscar Zariski. In the 1956 paper in the Annals 
of Mathematics [A2], which was the first part of my Ph.D Thesis, I proved reso- 
lution of singularities of algebraic surfaces in nonzero characteristic. In the 1955 
American Journal paper, I was showing why Jung's method of surface resolution 
in the complex case docs not generalize to nonzero characteristic, because the local 
fundamental group, above a normal crossing of the branch locus, in the former case 
is abelian whereas in the latter case it can even be unsolvable. It was a surface 
constructed for this purpose whose section I took in the 1957 paper. 

Although the second equation F„ is also a slight generalization of an equation 
occuring in the 1957 paper but, amusingly, it got rediscovered in 1989 as a variation 
of the first equation f „ . 

Now (I.l) was originally proved by Serre and when he told me about it in Sep- 
tember 1988, that is what started off my calculations after a thirty year freeze! As 
a slight generalization of (I.l), in the case oi t = 1,1 can also calculate the Galois 
group Gn^q = Gal{Fn^q, k{X)) of the unramified covering of given by 

F„,q = y" - aX-^Y* + 1 with n = q + t, 

where q is any positive power of p, and it turns out that 

(III) 

(III.l) If i = 1, then G.a,q = PSL(2, q) = PSL(2, n - 1). 
Again, Fn,q is a slight generalization of the equation 

Fn,g = F" - Xy* + 1 with n = q + t, 

which also occurs in the 1957 paper. Note that then 
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12. Using MRT 

Applying MRT (= method of removing tame ramification through cyclic com- 

positums = so called Abhyankar's Lemma) to the one-point stabilizer of F„+i we 
get the monic polynomial of degree n in y with coefficients in k{X) given by 

= h{Y) {Y + hf - aX-'Y* with 7^ 6 e fc, 

where h{Y) is the monic polynomial of degree n—p in Y with coefficients in k given 

by 

_ (y + n + i)"+^-p-y"+^-p 

and we let G'^ = Gal{F^, k{X)). As an immediate consequence of (I) we now get 

(IV) 

Assuming that n + 1 ^ 0(p), in the following cases F^ gives an unramified 
covering of Lk with the indicated Galois group. 

(IV.l) If n + 1 - p = i > 2 7^ p and 6 = i and s = 0(p - 1) and s = 0(i), then 

(IV.2) If n + 1-p = t = 2 andp ^ 7 and 6 = t and s = 0(p- 1), then = 
(IV.3) If n = p + 1 and p > 5, then t can be chosen so that 1 < t < 2±i and 
GCD(p+l,i) = 1, and for any such t, upon assuming 6 = and s = 0{t{p+l—t)), 

we have ^ An- 

(IV.4) If n + 1 - p = t and p = 2 and 6 = t and s = 0(f), then G^ = S^- 



13. Unramified coverings 

We have the following four corollaries of calculations (I) through (IV). 

In Calculations (I) through (IV), we use a lot of Ramification Theory, or, equiv- 
alently, CS (= Cycle Structure). In addition to OS + MTR + MRT, in the proofs 
of Calculations (I) to (IV) we also use CT. In our original version of these proofs, 
the use of CT was heavy. Gradually the use of CT decreased, but could not be 
removed completely. However, by traversing a delicate path through calculations 
(I) through (IV) , we have arranged a proof of the First and the Second Corollaries 
independent of CT. 

First Corollary. OPAGC is true. Equivalently, for any n > p > 2, there exists 
an unramified covering of the affine line in characteristic p whose Galois group is 
the alternating group An of degree n. 

Second Corollary. EPSGC is true. Equivalently, for any n> p = 2, there exists 
an unramified covering of the affine line in characteristic p whose Galois group is 

the symmetric group Sn of degree n. 

Third Corollary. Unramified coverings of the affine line in characteristic p with 
a few more Galois groups have been constructed. 
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Definition-Remark. By the minimal index of a finite group G we mean the 
smallest number d such that G has a subgroup of index d which does not contain 
any nonidentity normal subgroup of G.^ Now QC is obviously equivalent to saying 
that for every integer d> p the following is true. 

QC{d). Every quasi p-group of minimal index d belongs to -KAiLk), i-e., occurs as 
the Galois group of an unramified covering of the affine line in characteristic p. 

Therefore it is interesting to point out that as a corollary of the above results 
we have the following: 

Fourth Corollary. QC{p+ 1) is true for every p which is not a Mersenne prime^ 
and which is different from 11 and 23. More generally, if G is a quasi p-group 
containing a subgroup H of index p+1 such that H does not contain any nonidentity 
norm,al subgroup of G, and if p is not a Mesenne prime and p is different from 11 
and 23, then there exists an unramified covering of the affine line in characteristic 
p having G as the Galois group. 

The proofs of the above four Corollaries and the four claims (I) to (IV) will be 
completely given in this paper with the exception that the proof of claim (1.2) will 
be completed in my forthcoming paper [A7]. In connection with (II. 4), it may be 
noted that the case oi p = 11 or 23 is still open.^° 



14. History of a pilgrimage 

When I said that "Now... I can calculate these Galois groups", what I really 

meant was that, from September 1988 to August 1989, I undertook a pilgrimage 
(physical as well as mental) to seek the help of lots of mathematicians, and then I 
simply collated the help so obtained. In chronological order, these mathematicians 
were. Scire (oh yes, very much Serre), Kantor, Feit, Cameron, Sathaye, Eakin, 
Stennerson, Gorenstein, O'Nan, Mulay, and Neumann. 

The pilgrimage started when in September and October of 1988, Serre sent me 
one after another four long letters briefly saying that 

"In your 1957 paper you suggested that the Galois group of should be calcu- 
lated. I can now prove that for t=l it is PSL(2,p). Can you calculate it for other 
values of tl Also, the conjectures in your paper include AGC. Can you now prove 
AGCT 

Fortunately, in his last letter, Serre added a sentence saying that "my e-mail 
is... ." 



*In other words, we are minimizing the index over subgroups of G which do not contain any 

minimal normal subgroups of G, where we recall that a minimal normal subgroup of a group G 
is a nonidentity normal subgroup N of G such that N does not contain any nonidentity normal 
subgroup of G other than A' itself. 

Mersenne prime is a prime p of the form p = 2'^ — 1 for some positive integer /i. Note that 
then // is necessarily prime, because otherwise by factoring fi = fJ-' fJ." with fi' > I and //" > 1, 

we would get a factorization 2'' - 1 = I' I" with = 2''' - 1 > 1 and I" = 1 + 2''' + 22^' -| \- 

2m'(m"-i) > 1. 

As will become apparent later, the reason for this, as well as for the exclusion of these values 
of p from the Fourth Corollary, is the existence of the "Mathieu Groups" . 
mental = e-mail + s-mail. s-mail = snail mail = usual mail. 
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15. Multiply transitive groups 

Having already ciommented on the significance of transitivity for Galois theory, 
before proceeding further with calculations of Galois groups, let us give a brief 
review of multiply transitive groups. 

So let G be a permutation group, say of degree n, i.e., let G act on Q = 
{1,2,..., n}. In analogy with the concept of transitivity introduced in §3, we say 
that G is X-antitransitive (or A- fold antitransitive) for a positive integer A < n, if for 
all pairwise distinct elements ii, Z2, . . . , u in we have that the identity is the only 
member of G which keeps them fixed. Moreover, for positive integers I < X < n, 
we say that G is {I, X)-transitive if G is i-transitive and A-antitransitive; we may 
express this by simply saying that G is (^,A).-^^ Finally, G is sharply l-transitive 
means G is {1,1). Now if G is Z-transitive, with / > 1, then the one-point stabilizer 
of G is obviously (/ — l)-transitive as a permutation group of degree n — 1, acting on 
the "remaining" n — 1 clcmcints: conversely, if G is transitive and its one-point sta- 
bilizer is {I — l)-transitive then G is /-transitive. Similarly, if G is A-antitransitive, 
with A > 1, then the one-point stabilizer of G is (A — l)-antitransitive as a permu- 
tation group of degree n — 1, acting on the "remaining" n — 1 elements; conversely, 
if G is transitive and its one-point stabilizer is {I — l)-antitransitive then G is l- 
antitransitive. Thus, to classify all A) groups, we can make induction and each 
time increase I, A, and n by one. So we start with (1, 1). 

By definition, G is regular means G is (1,1). Now the classification of (1,1) 
groups is either obvious or impossible. Obvious because it is so easy to define what 
is (1, 1). Indeed, the usual proof of the usual theorem which says that every finite 
group is a permutation group, amounts to representing the given group as a regular 
permutation group. Hence impossible because it would amount to classifying all 
finite groups. 

G is Frobenius means G is (1, 2) but not (1, 1). A prototype of a Frobenius group 
is the group of all affine linear transformations ax + h with a, 6 in a finite field, or 
more generally in a finite near-field. Zassenhaus [Z2], in his 1936 Thesis written 
under Artin, proved that the converse is true for (2, 2) groups. 

Zassenhaus' Theorem. G is (2,2) G = AGLNF(1,^') for some finite near- 
field 

Here, by AGLNF(1,\E') we are denoting the group of all affine linear transfor- 
mations of degree 1 over the near-field where "near-field" is a generalization 
of "field" obtained by weakening the distributive law. Namely, a near-field is an 
additive abelian group \1/ in which the nonzero elements form a multiplicative group 
such that for all a, 6, c in ^ we have a{b + c) = ab + ac. Thus we are not assuming 
the other distributive law (b + c)a = ba + be which is not a consequence of the first 
distributive law because the multiplication is not required to be commutative. It 
can easily be seen that, as in the case of a field, the number of elements in any 
finite near-field is a power of a prime number. By an affine linear transformation 



^^That is, if (7 6 G is such that a{ie) = ie for 1 < e < A then we must have (7 = 1. 

^^Note that if G is i-transitive and m-antitransitive for positive integers I < n and m < n, 
then automatically I < m. Also note that if G is Z-transitive for some positive integer / < n then 
G is i'-transitive for every positive integer I' < I. Likewise, if G is A-antitransitive for a positive 
integer A then G is A'-antitransitive for every positive integer A' > A with A' < n. 

^■^By making G act on itself by right or left (but not both) multiplication. 
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of degree 1 over the near-field ^' we mean a map 5* ^ 5* given hy x <—>^ ax + b with 
a, 6 in ^ and a ^ 0. It can easily be seen that distinct (a, b) give distinct maps 
^ ^. A proof of Zassenhaus' Theorem is given in 9.10 on page 424 of volume III 
of Huppert-Blackburn [HB].^^ In the proof of this Theorem, as well as in the proofs 
of various other theorems on multitransitive groups, an important role is played by 
the following Theorem of Probenius (1901) [Fr] for a proof of which we refer to 8.2 
on page 496 of volume I of Huppert-Blackburn [HB] . 

Frobenius' Theorem. A Frobenius group G always has a, (1, 1) normal subgroup. 
More precisely, the subset ofG consisting of the identity together with those elements 
a which fix no letter {i.e., a{i) ^ i for i = 1,2, ...,n) forms a regular normal 
subgroup ofG. 

In the notation AGLNF(1, ^'), the letters NF arc meant to remind us of a near- 
field. In case * = a field $, we may write AGL(1, $) instead of AGLNF(1, *). 

To put the notation AGL(1,$) in proper perspective, first we remark that for 
a field $ and a positive integer m the groups GL(m, $), PGL(m, $), SL(to, $), and 
PSL(m, <&) are defined by replacing GF{q) by $ in §5.^'' Note that Z{GL{m, $)) = 
the set of all scalar matrices, and Z{SL{m, $)) = SL(m, $) fl Z{GL{m, $)), where 
the center of any group F is denoted by Z{T), i.e., Z{T) is the normal subgroup 
of r given by putting Z{r) = {a £ T : ab = ba for all b G F}. Now a nonsingular 
m by m matrix a G GL(m, $) corresponds to the bijection $™ which sends 

any 1 by m matrix ^ G 'I''" to the matrix product G In this manner, 

the group GL(to, <&), and hence also the subgroup SL(to, may be regarded as a 
permutation group on Let ■p($'") be the (m — l)-dimensional projective space 
over where we think of as the set of all one-dimensional subspaces of <!>'". 

Now the bijection ^"^ corresponding to any a G GL(m, clearly induces a 

bijection 'P($'") —>■ moreover, if a* € GL(m, differs from a by a scalar 

matrix, then a and a* induce the same bijection Thus the group 

PGL(m, and hence also the subgroup PSL(to, <&), becomes a permutation group 
on Members of PGL(m, $) are called projective transformations of 'P($™). 

Note that for any 7 G PGL(m, $) and C G we have 7(C) G 'P($'"). 

Now the affine general linear group AGL(to, $) of degree m over $ may be 
introduced as the semidirect product x GL(m, $) of by GL(m, $) with 
the obvious action of GL(to, $) on where we recall that a group F is said 
to be the (internal) semidirect product of a normal subgroup 8 by a subgroup A 
provided F = 6 A and O n A = 1, and we note that in this case A acts on O by 
conjugation. Concretely, AGL(to, $) may be regarded as the set of all m by m -I- 1 
matrices whose entries are in the $ and whose m by m piece is nonsingular; in other 
words, we think of AGL(m, $) as the set of all pairs (a, (3) with a G GL(to, $) and 
j3 g where multiplication is defined by {a, f3){a' , P') = {aa',l3a' + /3').i^ To 
(a,/3) G AGL(m, $) there corresponds the bijection — > which sends every 

-"^^ Although Dickson found all finite near-fields, it was left to Zassenhaus to prove that there 
were no more. E. H. Moore [Mo] of the newly opened University of Chicago classified finite fields, 
around 1895, and then, around 1905, his students Wedderburn [W] and Dickson [D2] studied skew 
fields and near-fields respectively. 

^^Writing q for GF(q') is justified because for any prime power q, up to isomorphism, there is 
exactly one field with q elements. 

^'^This is an example of an "external" semidirect product. For further elucidation sec Suzuki 
[Su2], Huppert-Blackburn [HB], and Wielandt [Wi]. A lot of the group theory background, re- 
quired in this paper, I learned in the last two years from these nice books. I highly recommend 
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^ G to £,a + P G $m_i8 'pjjyg AGL(m. 5^) also becomes a permutation group on 
Members of AGL(m, $) are called affine transformations of 
To make another example of a semidirect product, let Aut $ be the group of all 
automorphisms of the field For every matrix 7 with entries in <I> and for every g G 
Aut let 7^ be the matrix obtained by applying g to each entry of 7. This gives an 
action of Aut on GL(m, <1>). The semidirect product GL(to, <I>) x Aut $ is denoted 
by rL(m, $) and members of rL(m, $) are called semilinear transformations of 

A member of F L(m, i>) may be thought of as a pair (g, a) with 5 G Aut $ and 
a G GL(Tn,'I>), and the corresponding bijection — > sends every ^ G i*"* to 

G <1'™. The multiplication in FL(to, $) is given by (g', a)(gi', a') = {gg',a^ a'). 
Thus FL(m, $), which may be called the semilinear group of degree m over 
acts on With this action we can form the semidirect product ArL(m, $) = 

XI rL(rn, and call it the affine semilinear group of degree m over <I>. A 
member of ArL(m, $) is called an affine semilinear transformation of and it 
may be represented as a triple {g, a, f3) with g G Aut a G GL(m, $), /3 G the 
corresponding bijection $™ ^ $™ sends every ^ G $™ to + /3 G "I"'", and the 
multiplication in A FL(to,$) is given by {g,a, P){g' ,a' , P') = {gg' ,a^' a' , f}^' a'+(3'). 
Thus ArL(TO, <&) is a permutation group on $™, and in a natural manner we have 

SL{m, $) <i GL(to, < F L(to, $) < A F L(m, $) 

and 

GL(m,<I>) < AGL(m,$) < AFL(m,$), 

where < and < denote subgroup and normal subgroup respectively. 

To construct one more example of a semidirect product, we note that the ac- 
tion of Aut $ on GL(to, <&) obviously induces an action of Aut $ on the factor 
group PGL(m, $). With this induced action we form the semidirect product 
PrL(m, $) = PGL(m, $) xi Aut$ and call it the projective semilinear group of 
degree m over $. A member of P F L(to, $) is called a projective semilinear transfor- 
mation of and it may be represented by a pair (7, g) with 7 G PGL(m, and 
g G Aut the corresponding bijection 'P($™) —>■ 'P($™) sends every C G V{^™) to 
7(C^) G where G is given by C = {^^ ■ ^ (z Thus PFL(m,$) 
becomes a permutation group on and in a natural manner we have 

PSL(to, < PGL(m, $) < P F L(m, 

Although we have spoken of SL(?7?, $), GL(r7;, $), F L(?77, $), AGL(?77, $), A F L(m, $) 
as permutation groups on (J?™, and PSL(m, PGL(m, P F L(m, $) as permu- 
tation groups on P ($'"), this is relevant mainly when i> = GF(5) for some prime 

them. It may be noted that a Frobcnius group G is the semidirect product of the Frobcnius kernel 
of G by a Frobcnius complement of G, where by the Frobenius kernel of G we mean the (1, 1) 
normal subgroup of G, and by a Frobenius complement of G we mean a 1-point stabilizer of G. 

^®In the above case of m = 1, this reduces to x t-^ ax + b hy taJiing i; = ^€^>, 0^a = a€$, 
and 6 = /? e 

^^Transformations, or substitutions, of the type x' = ax + by + c and y' = a'x + b'y + c' are 
familiar to us from high school. By adding "points at infinity" to the ordinary plane we get the 
projective plane. To distinguish between the ordinary plane and the projective plane, the ordinary 
plane is called the aiBne plane and the above transformations are called affine transformations of 
the plane . To know that they can be redefined in terms of semidirect products should help to 
maJje this notion friendly. 
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power q and in that case we may write SL(to, g),GL(m, g), rL(m, (7), AGL(m, g), 
rL(TO, q), PSL(to, g), PGL(m, q), and P rL(TO, g) for SL(m, $), GL(m, $), rL(m, $), 
AGL(m,$), ArL(m,$), PSL(m,$), PGL(m,$), andPrL(m,$) respectively. 

Henceforth by a permutation group we shall again mean a permutation group on 
a finite set. 

16. Zassenhaus groups 

Having talked about (1,1), (1,2), and (2,2) groups, let us now discuss (2,3) 
groups which are not (2, 2). Basically they fall into the following three classes. 

(i) A Feit group is defined to be a (2, 3) group which is not (2, 2) but has a (1, 1) 
normal subgroup. As a prototype we have the group A F L(l, 2^') where p is a prime 
number. This consists of all transformations x ^ ax^ + b with 7^ a € GF(2^') and 
b G GF(2P) and g G Aut GF(2p). Now | GF(2p)| = 2p and |Aut GF(2p)| = p where 
I I denotes cardinality. Thus AFL(1,2p) is a permutation group of degree 2^ and 
order 2^(2^ - l)p. 

(ii) A sharp Zassenhaus group is defined to be a (3, 3) group; such a group is 
clearly a (2, 3) group; moreover, it is a (2, 2) group only when its degree is 3 and in 
that case it is simply S3. The Fundamental Theorem of Projective Geometry says 
that, on the projective line over a field any three points can be sent to any other 
three points by one and only one projective transformation. In case, $ = GF((j'), 
where q is a prime power, this amounts to saying that the group PGL(2, q) is a 
(3, 3) group, where we regard PGL(2, q) as a permutation group of degree q + 1;^^ 
it is easily seen that PGL(2,(7) has no (1,1) normal subgroup; clearly the order 
of PGL(2,(jr) is (q + l)q{q — 1).^^ It can be shown that, if q is an even power of 
an odd prime, PrL(2,g) has exactly one 3-transitive subgroup, which we denote 
by PML(2,g), such that PML(2,g) PGL(2,g) and PSL(2,g) is a subgroup of 
PML(2, q) of index 2. Now PML(2, q) is also a (3, 3) group, where we again regard 
it as a permutation group of degree g+1; it is easily seen that PML(2, q) has no (1, 1) 
normal subgroup provided > 4; clearly the order of PML(2, q) is {q+l)q{q — l); we 
call PML(2. q) the projective mock linear group of degree 2 over GF((jr). For further 
discussion about PML(2, q) see page 163 of volume III of Huppert-Blackburn [HE] 
where it is denoted by M{q). 

(iii) A strict Zassenhaus group is defined to be a (2, 3) group which is neither 
(2,2) nor (3,3) and does not have any (1, 1) normal subgroup. It can easily be 
seen that, for any odd prime power q, the group PSL(2, q) is a strict Zassenhaus 
group of degree q+ \. To find the order of this group, we might as well start by 
calculating the order of GL(m, q) for any positive integer m and any prime power 
q which need not be odd. Now the number of ways of choosing the first column of 



"That is g is a positive integral power of a prime number. Clearly then q is an odd prime 
power or an even prime power according as the corresponding prime number is even or odd. When 
q is an odd prime power, it can be an even power of an odd prime or an odd power of an odd 
prime. 

■^1PGL(2, q) acts on the projective line over GF(q) which has q + I points on it, out of which 
q are at "finite distance" and one is the point at "infinity". 

Obviously the order of any {l,Vj group of degree n is n(n — 1) • • • (n — / + 1). 

^^In Gorenstein's book [Gl] every (2, 3) group which is not (2, 2) is called a Zassenhaus group. 
In Huppert and Blackburn's book [HB] every (2, 3) group which is not (2, 2) and does not have 
any (1, 1) normal subgroup is called a Zassenhaus group. We are calling the groups mentioned in 
(i) Feit groups because they were completely characterized by Feit [F] in 1960. 



GALOIS THEORY ON THE LINE IN NONZERO CHARACTERISTIC 15 

an element of GL{m,q) equals | GF{q)"^ \ — 1 = — 1. The number of multiples 
of the first column is q and hence, having chosen the first column, the number of 
ways of choosing the second column equals | GF{q)"^ \ — \ GF(g)| = q"^ — q. More 
generally, the first i columns generate an i-dimensional vector space over GF((j') 
and hence, having chosen the first i columns, the number of ways of choosing the 
{i + l)th columns equals | GY{q)'^\ - \ G¥{qy\ = - q\ Therefore 

I GL(m, q)\ = (g™ - l){q^ - q) ■ ■ ■ (g™ - 

Consequently 

|AGL(m,g)| = |GF(g)™||GL(m,g)| =g'"(g™-l)(g'"-g)---(g™-g™-i). 

Now I GF{q)* I = q — 1, where GF(g)* is the multiplicative group of nonzero elements 
of GF{q), and we have obvious exact sequences of finite groups 1 SL(m, g) — > 
GL{m,q) GF{q)* ^ 1 and 1 ^ GF(g)* ^ GL{m,q) PGL(m,g) ^ 1, and 
hence 

|PGL(m,g)| = |SL(m,g)| = \GL{m,q)\/{q - 1) 

Let 

=the cyclic group of order v where i/ is either a positive 
integer or oo. 

Now clearly 

Aut GF(g') = Zfj, where q = p'^ with p = char A; 

and hence 

|rL(m, q)\/\ GL(m, q)\ = | AFLK q)\/\ AGL(m, q)\ 

= |PrL(TO,g)|/PGL(TO,g)| = /x. 

Let SL(m, q)* be the group of all m by to scalar matrices whose entries are in GF(q') 
and whose determinant is 1. Then SL(to, q)* is isomorphic to the group of all TOth 

roots of 1 in GF(<7) and hence | SL{m,q)*\ = GCT)(m,q — 1). Also we have an 
obvious exact sequence of finite groups 1 SL(to, q)* SL(to, q) PSL(to, q) — » 
1 and hence 

I PSL(to, g)| = I SL(m, q)\/ GCD(to, q - 1) 

= I PGL(to, q)\/ GCD(to, q - 1). 

So in particular 

|PSL(2,g)| = (g+l)g(g-l)/2 or {q + l)q{q-l) 



according as q is odd or even i.e., according as g — 1 is or is not divisible by 2. If 
q is even then we cannot divide g — 1 by 2, and so we do the best we can; namely. 
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assuming g to be a square of a proper odd power^** of 2, in the expression {q+l)q(q — 
l)/2 we replace {q - l)/2 by (g^/^ - 1) to get the expression {q + l)q{q^^'^ - 1), 
and now to get rid of the fractional power we write q in place of g^/^. For every q 
which is a proper odd power of 2, the resulting expression (g2 + _ 1) is the 

order of a certain strict Zassenhaus group Sz(g) of degree + 1; this group Sz{q) 
is isomorphic^^ to a certain subgroup of GL(4, g) and, since it was discovered by 
Suzuki [Sul] in 1962, it is called the Suzuki group over GF((7). We may think of 
Sz(g) as ersatz PSL(2,g^); we have just given a heuristic reason for its existence 
and a mnemonic device for remembering its order; to recapitulate 

I Sz(g)| = (g^ + l)g^(g — 1) if g is any odd power of 2. 

As hinted in the above order formula, the definition of the Suzuki group Sz(g) can 
be extended so as to include the case of g = 2; in this case we still get a 2-transitive 
permutation group of degree g^ + 1 and of the above order, which is however not 
a strict Zassenhaus group; indeed, as a permutation group, Sz(2) is isomorphic to 
the (2,2) group AGL(1,5). 

The following theorem of Zassenhaus [Zl], Feit [F], and Suzuki [Sul] says that 
the above examples of (2, 3) groups which are not (2, 2) are exhaustive; for a proof 
see 1.1 and 11.16 on pages 161 and 286 of volume III of Huppert-Blackburn [HB]. 

Zassenhaus-Feit-Suzuki Theorem. For a permutation group G we have the 
following. 

(1) G is Feit 4^ G = ArL(l, 2^') for some prime p. 

(2) G is sharp Zassenhaus <^ G = PGL(2, g) for some prime power q, or G = 
PML(2, g) for some even power q of an odd prime. 

(3) G is strict Zassenhaus G = PSL(2, g) for some odd prime power q, or 
G = Sz(g) for some proper odd power q of 2. 

Moreover, this gives an exhaustive and mutually exclusive listing of (2, 3) groups 
which are not (2,2), with the proviso that the group PSL(2,2) = 53 is included in 
item (2) even though it is a (2,2) group in addition to being a (3,3), and hence a 
(2,3), group. 

17. More about classification theorems 

By analyzing the Suzuki group Sz(g), a certain analogous 2-transitive permuta- 
tion group i?i(g) of degree g^ -|- 1, for every odd power g of 3, was discovered by 
Ree [R] in 1964; although i?i(g) is 2-transitive, it is not 3-antitransitive; the group 
i?i(g) is defined in terms of some 7 by 7 matrices over GF(g) and is called the Ree 
group over GF(g); the order of i?i(g) is given by 

|i?i(g)| = (g^ + l)g^(g — 1) if g is any odd power of 3. 

For every odd power g of 2, in terms of certain matrices over GF(g), Ree defined a 
group R2{q) which is also called the Ree group over GF(g); the order is now given 

by 

\R2{q)\ = <z''(g' + 1)(9' - l)(g' + l){q - 1) 



That is with an odd exponent > 1. 
Not as a permutation group. 
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if q is any odd power of 2. All the Suzuki groups and all the Ree groups turn out 
to be a simple groups except: Sz(2) is solvable; i?i(3) is isomorphic to PrL(2,8) 
and hence the simple group PSL(2, 8) may be regarded as a normal subgroup of 
i?i(3) of index 3 and then PSL(2,8) is the unique minimal normal subgroup of 
i?i(3); the commutator subgroup -R2(2)' of i?2(2) is a (normal) subgroup of i?2(2) 
of index 2, and the said commutator subgroup i?2(2)' is actually a simple group 
called the Tits group. Thus we have the following three infinite families of finite 
simple groups deduced from matrices over GF(q'). 

Suzuki and Ree groups. Sz(g) for every proper odd power q of 2. Ri(q) for 
every proper odd power q of 3. R2{q) for every proper odd power q of 2, together 
with the commutator subgroup i?2(2)'. 

Just before Suzuki and Ree found these groups, Steinberg [St], in 1959, discovered 
that some known groups together with some further ones suggested by them could 
be organized into four other infinite families of finite simple groups which are defined 
in terms of matrices over GF{q'^) and which are now labeled as follows. 

Steinberg groups. ^^„(g), ^£>„(g), ^1)4(9), and ^£^6(3)- Here n is any positive 
integer and q is any prime power except: in case of ^An{q) exclude {n,q) = 
(1,2), (1,3), (2,2), and in case of ^D„(g) exclude n = 1. 

These 3+4 = 7 families are the "twisted incarnations" of the original nine infinite 
families of "relatives" of An{q)- These nine infinite families of finite simple groups 
defined in terms of matrices over GF(g) were systematized by Chevalley [Ch] in 
1955, and they are labeled as follows. 

Chevalley groups. A„ (g) , B„ (g) , C„ (g) , Z)„ (g) , (g) , £^7 (g) , -^s (g) , -F4 (9) , and 62 (g) . 
Again here n is any positive integer and q is any prime power except: in case of 
An{q) exclude (n, g) = (1,2), (1,3); in case of Bn{q) and Cn{q) exclude (n, g) = 
(1,2), (1,3), (2,2); in case of Dn{q) exclude n < 4; and in case of G2{q) exclude 
g = 2. 

Thus we have 7 + 9 = 16 infinite families of finite simple groups defined in terms 
of matrices over finite fields. It may be noted that, just as the group A„(g) = 
PSL(n + l,g) is obtained by projectivizing the special hnear group SL(n + l,g), 
the group C„(g) = PSp(2n, g) is obtained by projectivizing the "symplectic group" 
Sp(2n,g), and the groups Bn{q) = PQ{2n+ l,g) and Dn{q) = P0"'"(2n,g) are 
obtained by projectivizing the "commutator groups" 0(2n + l,g) and f2+(2n, g) 
of the "orthogonal groups" 0(2n + l,g) and 0'''(2n, g) respectively. The groups 
An{q), Bn{q), Cn{q),Dn{q) are collectively called "classical groups", and the re- 
maining Chevalley groups E(i{q),ET{q),Es{q),F4{q),G2{q) are collectively called 
"exceptional groups." 

Likewise, amongst the Steinberg groups, the group ^A„(g) = PSU(n + l,g) 
is obtained by projectivizing the "special unitary group" S\J{n + l,g), and the 
group ^_D„(g) = Pil~{2n,q) is obtained by projectivizing the "commutator group" 
fi~(2n, g) of the "orthogonal group" 0~(2n, g). The Suzuki and Ree groups have 
the alternative labels Sz(g) = ^B2(g), -Ri(g) = ^G2(g), and i?2(g) = ^-F4(g). The 
groups ^A„(g), ^^2(5), ^Dn{q), ^D4{q) may collectively be called the "twisted 



^^When q = a, prime p, many of these were already studied by Jordan [Jl] in the last century. 
For general q, some of them were discussed by Dickson [Dl] at the turn of the century. Likewise 
the first two families of Steinberg groups were already known to Jordan and Dickson, while the 
last two were independently found by Tits. 
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classical groups", and the groups ^EQ{q), ^F4{{q), ^^2(5) may collectively be called 
the "twisted exceptional groups." 

Now the projective mock linear group PML(2, q) may also be called the pre- 
mathieu linear group of degree 2 over GF(<7). The reason for this nomenclature is 
that, around 1865, Mathieu [Mat] found a transitive extension of PML(2, 9) which 
is denoted by Mn. By a transitive extension of a permutation group of degree n 
we mean a transitive permutation group of degree n + 1 having the given group as 
a one-point stabilizer. Note that Mn is a (4,4) group of degree 11 and hence its 
order is 11 • 10 • 9 • 8. Mathieu also found a transitive extension of Mn which we 
denote by M12. Clearly M12 is a (5,5) group of degree 12 and hence its order is 
12 • 11 • 10 • 9 • 8. The permutation groups Mn and M12 are called Mathieu groups of 
degree 11 and 12 respectively. It can be shown that M12 has a noninner automor- 
phism a of order 2 such that a(Mii) is 3-transitive of degree 12. The permutation 
group a(Mii) may be regarded as an incarnation of the Mathieu group Mn and 
we may denote it by Mu. To recapitulate 

IM12I = 12- 11 • 10- 9 • 8 and |Mii| = |Mii| = 11 • 10 • 9 • 8. 

To introduce the remaining three groups discovered by Mathieu, let us first note 
that for any positive integer m and any prime power q we have | GF(g)'"| = q'^ and 
hence for the corresponding (m — l)-dimensional projective space we have 

|7'(GF(q)™)| = (q™ - l)/(g - 1) = + + + 1. 

So in particular |-p(GF(4)3)| = 21 and hence PSL(3,4), which acts onP(GF(4)3), 
is a permutation group of degree 21; it can easily be seen that it is 2-transitive but 
not 3-transitive; by the above order formula we also get 

I PSL(3, 4)1 = (4^ - 1)(4^ - 4)4V GCD(3, 3) = 21 • 20 • 48. 

Mathieu obtained a transitive extension M22 of PSL(3,4), a transitive extension 
M23 of M22, and a transitive extension M24 of M23. Clearly M22 is a 3-transitive 
but not 4- transitive permutation group of degree 22, M23 is a 4- transitive but not 
5-transitive permutation of degree 23, and M24 is a 5-transitive but not 6-transitive 
permutation group of degree 24. These groups are called Mathieu groups of degree 
22, 23, 24 respectively, and obviously their orders are^^ IM22I = 22 • 21 • 20 • 48; 
IM23I = 23 • 22 • 21 • 20 • 48, and \M24\ = 24 • 23 • 22 • 21 • 20 • 48. Note the striking 
similarity between the numbers (2,9) occuring in the "parent group" PML(2,9) of 
Mil and M12, and the numbers (3,4) occuring in the "parent group" PSL(3,4) of 
M22, M23, and M24. All of the five Mathieu groups Mu, M12, M22, M23, and M24 
turn out to be simple groups. One hundred years after their discovery, during 1965 
to 1975, twenty-one other finite simple sporadic groups, i.e., those which do not 
naturally fit in any infinite family, were discovered by various people; the largest of 
the 21 -h 5 = 26 sporadic groups is called the monster and its order is 

246 . 320 . 59 . 76 . . -^33 . 17 . 19 . 23 ■ 29 • 31 ■ 41 • 47 • 59 • 71; 

for details see Gorenstein [Gl]. We are now ready to state, of course without proof, 
the Classification Theorem of Finite Simple Groups. 

^^For a transitive permutation group G of degree n we clearly have \G\ = n\Gi\ where Gi is 
a one-point stabilizer of G. 

^*See Gorenstein [G2] or Aschbacher [As]. At least one part of this extremely long proof, 
namely Mason's paper on quasi-thin groups [Mas] , is still to see the light of day! 
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CT. The following is a complete list of finite simple groups. 

(1) The cyclic group Zp for every prime p. 

(2) The alternating group An for every integer n > 5. 

(3) The sixteen infinite families of "matrix" groups mentioned above?^ 

(4) The twenty-six sporadics including the five Mathieus. 

Just as PSL(m, q) is the typical example of a finite simple group, soPGL(2, q) 
is the quintessential example of a 3-transitive permutation group. As obvious vari- 
ations of this, additional 3-transitive permutation groups are obtained by taking 
into account all the groups between PGL(2, q) and P P L(2, q) for every prime power 
q, and all the groups between PML(2,(7) and PrL(2,g) for every even power q of 
an odd prime; both these types have degree g -I- 1 . All these arise out of the fact 
that, by a projective transformation, any 3 points of a projective line can be sent 
to any other 3. Going to higher dimensions, by a projective transformation, any 
4 points of a projective plane can be sent to any other 4, and any 5 points of a 
projective 3-space can be sent to any other 5, and so on. This would give rise to 
4-transitives, 5-transitives, and so on. But there is a flaw. 3 collinear points in a 
projective plane, or in a projective 3-space, cannot be sent to 3 noncollinear points. 
4 coplanar points in a projective 3-space cannot be sent to 4 noncoplanar points. 
And so on. Thus, because of questions of linear independence, for every integer 
m > 1 and every prime power q, the group PGL(m, q), instead of being (m -I- 1)- 
transitive, is only 2-transitive, unless every line contains only 2 distinct points, in 
which case it would be 3-transitive. Well, for q = 2, & line should contain only 2 
points. But even that is so only in the afSne case because then we don't have the 
point at infinity. Thus, it is not PGL(m, 2) which is 3-transitive, but AGL(r/7. 2). 
We can see that AGL(m, 2) is, however, not 4-transitive unless m = 2 in which case 
we actually have AGL(2, 2) = S4. 

Now 

AGL(m,2) = GF(2)'" x GL(m,2) = 2™ • L„(2), 

where a dot stands for the semidircct product symbol and 2™ stands for (^2)™, 
i.e., for the direct product Z2 x Z2 x ■ ■ ■ x Z2 of m copies of Z2. By the order 
formula we have 

|L4(2)| = (24 - 1)(24 - 2)(24 - 4)(24 - 8) = 8!/2 = \As\. 

Hence, by the philosophical principle that two finite sim,ple groups of equal order are 
usually isomorphic, we expect that .^4(2) « As where « stands for isomorphism, 
and this can, in fact, be easily proved. Note that .^4(2) = GL(4, 2) is a 1-point 
stabilizer of AGL(4, 2) and hence in this incarnation Ag is only 2-transitive;^° let 
us denote this incarnation by As-^^ In a natural manner, A7 may be regarded 
as a subgroup of As and then it turns out that the image of Ay under the said 



^'^l am using the more friendly term "matrix groups" instead of the awe inspiring "Lie type 
groups." 

^"By taking the stabihzer at the origin, GL(m, q) becomes the 1-point stabihzer of AGL(m, q) 
for every integer m > 1 and every prime power q. Likewise, by taJfing the stabilizer at the point 
at infinity, AGL(1, q) may be regarded as the 1-point stabilizer of PGL(2, q) for every prime power 
?• 

^^That is, as a permutation group, As = GL(4, 2) = PGL(4, 2) C 5i6. 
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isomorphism is also only 2-transitivc; lot us denote the said image by yly.^^ The 
corresponding subgroup of 2^ • = AGL(4, 2) may be denoted by 2^ ■ At, this is a 
3-transitive but not 4-transitive permutation group of degree 2^ and order 2^ • 7!.^^ 
As a consequence of CT, it can be shown that there are no more 3-transitive 
permutation groups other than those we have already listed. In other words we 
have the following detailed version of CTT, i.e., the Classification Theorem of Triply 
Transitive Permutation Groups; this theorem was compiled from conversations with 
Cameron. Neumann, and O'Nan. 

CTT or Refined Fundamental Theorem of Projective Geometry. The 

following is a complete list of 3-transitive permutation groups. 

(1) For every prime power q, each group between PGL(2,q') andPTL{2,q) is a 
3-transitive permutation group of degree q + 1, and we have \ PGL(2, q)\ = 

I PrL(2, = {q+l)q{q— 1) where q = p'^ with p = char GF((7). Among 
these, PGL(2,3) and PrL(2,4) are the only groups which are A-transitive, 
and for them we actually have PGL(2, 3) = S4 and P F L(2, 4) = 6*5 . 

(2) For every even power q of an odd prime, each group between PML(2, q) and 
PrL(2, q) is a 3-transitive but not A-transitive permutation group of degree 
q + l, and we have |PML(2,g)| = | P rL(2, = {q -\- l)q{q - 1) where 
q= p^ with p = char GF(g'). 

(3) For every integer m> 1, the group AGL(m, 2) is a 3-transitive permutation 
group of degree 2™ and order 2'"(2™ - 1)(2™ - 2) • • • (2™ - 2™-i). This is 4- 
transitive only for m = 2, and in that case we actually have AGL(2, 2) = 5*4. 

(4) The group 2* -A-y is a 3-transitive but not A-transitive permutation group of 
degree 2^ and order 2^-7!, and as a permutation group it is a subgroup of 
24-l8 = AGL(4,2). _ 

(5) The reincarnated Mathieu group Mn is a 3-transitive but not A-transitive 
permutation group of degree 12 and order 11 • 10 • 9 • 8 = 12 • 11 • 10 • 6. 

(6) The Mathieu group M22 and its autom,orphism group Aut M22 are 3-transitive 
but not A-transitive permutation groups of degree 22 with IM22I = |AutM22|/2 = 
22 -21 -20 -48. 

(7) The Mathieu groups Mn and M23 are A-transitive but not 5-transitive per- 
mutation groups of degree 11 and 23 and order 11-10-9-8 and 23-22-21-20-48 

respectively. 

(8) The Mathieu groups M12 and M24 are 5-transitive but not 6-transitive per- 
mutation groups of degree 12 and 24 and order 12 • 11 • 10 • 9 • 8 and 
24 • 23 • 22 ■ 21 • 20 ■ 48 respectively. 

(9) For every integer n > 5, the alternating group An is an (n — 2)-transitive 
but not {n — \) -transitive permutation group of degree n and order n\/2. 

(10) For every integer n>3, the symmetric group Sn is an n-transitive but not 
{n + I) -transitive perm,utation group of degree n and order n\. 

The above formulation of CTT obviously subsumes CQT, CFT, and CST. In 
turn the CTT is of course subsumed under the CDT which is given by Cameron [C] 

^^In other words, is the image of Ar under some injective group homomorphism Aj — » 
As = GL(4, 2) C Sis, and A7 is a 2- transitive but not 3-transitive permutation group of degree 
15. 

As permutation groups, A7 and 2^ ■ A7 are independent of the injective group homomorphism 
A7 — ► As we choose for defining A7. 
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and Kantor [K2] and which, in addition to heavily using the CT, is based on the 
previous work of Curtis-Kantor-Seitz [CKS], O'Nan [O], and others. The following 
weaker version of CDT, dealing mainly with the degrees of 2-transitive permutation 
groups which are not 3-transitive, was communicated to me by Cameron. 

Weak CDT. Concerning the degrees of 2-transitive permutation groups we have 
the following. 

(1) For every integer m > 1 and every prime power q, each group between 
PSL{m,q) and PTL{m,q) is a 2-transitive permutation group of degree 
|7'(GF(g)")| = (g™ - l)/{q - 1). Out of these, only the groups listed in 
items (1) and (2) of CTT are 3-transitive. In case ofm>2, for each group 
between PSL(m,q') and PrL{m,q), by considering the action on "hyper- 
planes" in V{GF{q)"^), we get a second representation as a 2-transitive but 
not 3-transitive permutation group of degree (g™ — l)/{q — 1). 

(2) For every integer m > 2, the group Sp(2m, 2) has 2-transitive but not 3- 

transitive permutation representations of degrees 2^'""^ + 2"*~^ and 2^™"^ — 
2m- 1_ 

(3) For every prime power q = > 2 with prime p, each group between 
PSU(3, q) and its automorphism group Aut PSU(3, q) has a 2-transitive but 
not 3-transitive permutation representation of degree q^ + 1, and moreover 
PSU(3, q) is a normal subgroup of index [GCD(3, g'+l)]2/u in Aut PSU(3, q).^'^ 

(4) For every proper odd power q = 2^ of 2, each group between the Suzuki 
group Sz(g') and its automorphism group Aut Sz((7) has a 2-transitive but 
not 3-transitive permutation group of degree <^ + 1, and moreover Sz((j') is 
a normal subgroup of index /i in Aut Sz(q). 

(5) For every odd power q = 3^ of 3, each group between the Ree group R\{q) 
and its automorphism group Aut R\{q) has a representation as a 2-transitive 
but not 3-transitive permutation group of degree q^ + 1, and moreover Ri{q) 
is a normal subgroup of index fi in Aut Ri (g) . 

(6) The group PSL(2, 11) has two distinct 2-transitive but not 3-transitive per- 
mutation representations of degree 11.^^ 

(7) The alternating group has two distinct 2-transitive but not 3-transitive 
permutation representations of degree 15; both are equivalent to isomor- 
phisms Aj ^ Aj c S15. 

(8) The "Higman-Sims" sporadic group HS has two different 2-transitive but 
not 3-transitive permutation representations of degree 176. 

(9) The "third Conway" sporadic group C03 has a 2-transitive but not 3-transitive 
permutation representation of degree 276. 

The above items (1) to (9) contain a complete list of 2-transitive but not 3-transitive 
permutation groups having a nonabelian minimal normal subgroup. The degree of 
a 2-transitive but not 3-transitive permutation group G having an abelian minimal 
normal subgroup is necessarily some power of some prime p; the said minimal 

centerless group G, i.e., a group G whose center is the identity, may be identified with its 
inner automorphism group and hence may be regarded as a normal subgroup of its automorphism 
group Aut G. 

^^Two permutation representations G Sn and G — > 5„/ of a (finite) group G axe equivalent 
if they differ by an isomorphism Sn — > S„/ induced by a bijection between the underlying sets of 
S„ and S„i ; note that then automatically n = n'. Two representations are distinct if they are not 
equivalent. 
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normal subgroup is isomorphic to (Zp)"^ and a 1-point stabilizer of the group G 
itself is isomorphic to a subgroup of GL{m,p); moreover, in the case m = 1 we 
must have G = AGL(l,p), whereas, in the case m = a prime number and p = 2 
we m,ust have G = AGL{1, 2"^) orArL(l,2™), and finally, in the case m — 2 and 
p = 3 we must have G = AGL{1, 9) orArL(l,9) or AML(1, 9) or AGL(2, 3) or 
ASL(2,3), where we have put AML(1,9) = the 1-point stabilizer o/ PML(2, 9),36 
and ASL(2,3) = (GF(3))2 xi SL(2,3). 

In connection with the above statement, we note the following Theorem of Burn- 
side which is really the starting point of the classification of 2-transitive permutation 
groups. Although most modern proofs of this make use of Frobe- nius' Theorem 
(1901) [Fr],'^'' it is clear that Burnside's original proof did not, since it is already 
given as Theorem IX on page 192 of the first edition of [Bu] published in 1897. In 
the second edition of [Bu] published in 1911, it occurs as Theorem XIII on page 
202, and there Burnside gives two proofs of it, one using Frobenius and the other 
without.^* Thus Burnside gives an "elementary" proof of the following Theorem 
without using "character theory." '^^ 

Burnside's Theorem. A 2-transitive permutation group has a unique minimal 
normal subgroup. The said subgroup is either an elementary abelian group'^^ or a 
nonabelian simple group. 

As a consequence of CTT and Weak CDT, and in view of a simple numerical 
lemma, we have the following. 

Special CDT. Given any prim,e p and any positive integer ji, concerning 2- 
transitive but not i -transitive permutation groups of degree g + 1, where q = p^, we 
have the following. If n = 1 andp is a Mersenne prime, then PSL(2,p), AGL(l,p+ 
1) and ArL(l,p+ 1) are the only such groups. If fj, = 1 but p is not a Mersenne 
prime, then PSL(2,p) is the only such group, except that for p = 2 this group is 
"accidentally" 3-transitive because it coincides with S3. If > 1 then, in addi- 
tion to the relevant groups listed in items (1), (3), (4), (5) of Weak CDT, the only 
other such groups are the groups AGL(1,9), ArL(l,9), AML(1,9), AGL(2,3), 
and ASL(2, 3), which occur when {fJb,p) = (3, 2), and the group AGL(1, g'+ 1) which 
occurs when q-\-l is a Fermat prime.^^ 

Here is the said 



^^In other words, AML(1,9) is the 2-point stabilizer of the Mathieu group Mn. 

^'^For Frobenius' Theorem there is no "character free" proof. As examples of modern proofs 
of Burnside's Theorem which seem to use Frobenius' Theorem, see 12.4 on page 32 of Wielandt 
[Wi] and 7.12 on page 233 of volume III of Huppert-Blackburn [HE]. 

^*In Burnside's classical style of writing, Theorem x means Theorem x together with the 
discussion around it. In other words, although everything is proved, only some of the conclusions 
are called theorems. This "classical" style is quite different from the so called "Landau Style" of 
Satz-Beweis-Bcmcrkuiig. In the classical style, you first discuss things and then suddenly say that 
you have proved such and such; in other words, the proof precedes the statement of a theorem. 

^^And certainly without using CT! 

^''A group is elementary abelian if it is isomorphic to (Zp)"^ for some positive integer m and 
some prime p. 

^^A Fermat prime is a prime of the form 2** + 1 for some positive integer fi. It follows then 
that fj, must be a power of 2, because otherwise fj, = fi' /i" where fj,' is even and fi" > 1 is odd and 
this would give the nontrivial factorization 2'' + 1 = (2''' + l){2i^'(i^"-^) - 2''' (''"-2) -\ h 1). 
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Simple Numerical Lemma. Given any primes p and tt and any positive integers 
/U and u, such that + 1 = tt", we have the following. If p > 2, then ^ = 1 and 
TT = 2 and u = a prime number, and so p must be a Mersenne prime. If p = 2 and 
u> 1, then fj. = 3 and tt = 3 and u = 2. If p = 2 and u = 1, then fi = a power of 

2, and so n must be a Fermat prime. 

To see this, first suppose that p> 2; now p is odd and hence + 1 is even and 
hence tt = 2; since p is odd, we have p = 1(4) or p = 3(4) and hence p"^ = 1(4), and 
therefore if fi is even then 2" — p^^ + 1 = 2(4) and this would imply u = 1 which 
would be absurd; on the other hand, if fi is odd then 2"^ = p'^ + 1 = {p + l)(y»^~^ — 

p^~'^ -I 1-1) where the second parenthesis contains an odd number of odd terms 

and hence its value is odd, but since that value divides 2", it must be 1, and this 
gives p^ + 1 =p+l which implies /U = 1. Next suppose that p = 2 and u> 1; now tt 

must be odd, and if u is also odd then 2'' = tt" - 1 = (tt - l)(7r"-i + Tr"-^ H hi) 

where the scicond parenthesis consists of u positive odd terms and hence its value 
is an odd number > u and, since it divides 2^, it must be 1 which is absurd; on the 
other hand, if u is even then 2*^ = tt" - 1 = (77"/^ - l)(7r"/2 + 1) and hence Tr"/^ _ 1 
and 7r"/2 + 1 are both powers of 2 whose difference is 2 and therefore they must be 
equal to 2 and 4, and this gives /U = 3 and tt = 3 and u = 2. Finally, \i p = 2 and 
u = 1, then by the last footnote we see that must be a power of 2, and tt must 
be a Fermat prime. 

Here is another consequence of CDT. 

Uniqueness Theorem for Transitive Extensions. Any two transitive exten- 
sions of a transitive permutation group are isomorphic as permutation groups, with 

only one exception. "^"^ The exception is thai PSL{2, 7) and A rL(l, 8) have a com- 
mon 1-point stabilizer, note that both these are 2-transitive but not 3-transitive 
permutation groups of degree 8. 

As an immediate corollary of the above theorem we have the following. 

Uniqueness Theorem for Transitive Extensions of 2-Treinsitive Groups. Any 

two transitive extensions of a 2-transitive permutation group are isomorphic as per- 
mutation groups. In particular, the Mathieu groups Mn, M12, M22, M23, and M24 
are the unique transitive extensions o/PML(2, 9), Mn, PSL(3, 4), M22, and M23 re- 
spectively. 

To end this review of group theory, we note that by the rank of a transitive 
permutation group is meant the number of orbits of its 1-point stabilizer; the lengths 
of these orbits, excluding the obvious one point orbit, are called subdegrees of the 
group; so the number of subdegrees is one less than the rank, and the sum of 
the subdegrees is one less than the degree. Thus a 2-transitive group is simply a 
transitive group of rank 2. Now CT has also been used by Kantor-Liebler [KL], 
Liebeck [L], and others, to give CR3 = classification of transitive groups of rank 3, 
which although much longer than CDT, should be quite useful for Galois theory. 
Here is an amusing sample from CR3 which does not use CT and which can be 
found in Kantor [Kl]. 

*^If we don't assume the given group to be transitive, then there are numerous exceptions. 
For example every finite group, in its standard representation as a regular permutation group, 
is a transitive extension of the identity group. Since for increasing I, there are fewer and fewer 
i-transitive permutation groups, it follows that "most" transitive permutation groups have no 
transitive extensions. 
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Sample from CR3. For any integer n > 1 and any prime power q, the groups 
PSp(2n, g) and 0{2n + are the only transitive permutation groups of rank 3 
whose subdegrees are — 1) a.ndq'^"'~^. A rank 3 transitive permutation 

group G with subdegrees q{q + 1)^ and q'^ for a prime power q > 1, is a subgroup of 
Aut PSL(4,g); moreover, if q>2 then G contains PSL(4, g). 

18. A TYPE OF DERIVATIVE 

To continue with the calculation of Galois groups, let me explain how to throw 
away a root a = ai of a polynomial 

n 

f = f{Y) = F" + aiF"-! + . . . + a„ = \{{Y - a,) 

i=l 

by using a type of derivative. Now the coefficients ai, . . . , a„ belong to a field K, 
and we want to find the polynomial 

/i = fi{Y) = = + + • • • + K-i e K{a)[Y\. 

To this end. first recall the three basic transformations of equations described in 
any old book. For instance, we may quote the following three relevant articles (= 
sections) from Burnside-Panton's 1904 book on the theory of equations [BP].^^ 

Art 31. To multiply the roots by a given quantity. For any u ^ 0, the 

polynomial g = g{Y) whose roots are u times the roots of / is given by 

g{Y) = u'^f = r" + ciF"-i + ... + Cn = 1[{Y - ua,) 

^ ^ i=l 

with Ci = u'ai. 

Art 32. To reciprocate the roots. In case aj 7^ for 1 < i < n, i.e., in case 
a„ ^ 0, the polynomial g = g{Y) whose roots are the reciprocals of the roots of / 
is given by 

9iY) = — / f ^) - - (1 + aiF + • • • + a„y") ^f[(Y - (- 

On \Y J an f-J^ V 

Art 33. To decrease the roots by a given quantity. For any u, the polynomial 
g = g{Y) whose roots are —u plus the roots of / is given by 

n 

g{Y) = f{Y + = y" + ciF"-! + . . . + c„ = - {ai - u)) 



For the last forty years I had happily assumed that this Burnside of the theory of equations 
[BP] was the same as the Burnside of the theory of groups of finite order [Bu] . To my dismay, at 
the Oxford Conference in April 1990, Peter Neumann told me that, although both were named 
William and both obtained a D.Sc. from Dublin around 1890, the equations Burnside was William 
Snow whereas the group theory Burnside was simply William. Strangely, I first learnt group theory 
from William Snow's book on the theory of equations. 
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with ci = tti + nu, . . . ,Cn = f(u). 

Now in the first and the third cases provided u <E K, and in the second case 
without any proviso, we have g{Y) g K[Y] and, assuming the roots ai, . . . , a„ to 
be pairwise distinct, we have Gal(g', K) = Gal(/, K) as permutation groups, and so 
for Galois theory purposes we may conveniently modify / by one or more of these 
three transformations. 

For example, sometimes it may be easier to compute the polynomial gi{Y) ob- 
tained by decreasing the roots of /i by a. In view of what we have just said, wc get 
Ga\{gi, K{a)) = Ga\{fi, K{a)) and hence, assuming / to be irreducible in 
for the one-point stabilizer Gi of G = Gal{f,K) we get Gi = Gal{gi,K{a)). 

Clearly gi can also be obtained by first decreasing the roots of / by a to get the 
polynomial g = g{Y) = f(Y + a), and then throwing away the root Y = of g; 
this gives gi{Y) = g{Y)/Y, and now remembering that /(a) = we get 

According to the calculus definition, by taking the "limit" of the RHS as Y tends 
to 0, we get f'{oi). This motivates the following definition according to which gi 
turns out to be the twisted Y -derivative of f at a. 

Definition. For any polynomial = 0{Y) in an indeterminate Y with coefficients 
in a field L and for any element /? in L, we call {6{Y + /3) — 6{f3))/Y the twisted 
Y -derivative of 9 at [3. 

For a moment let us denote the twisted ^-derivative of at /3 by 9' . Then clearly 
6' = 9'{Y) is a polynomial in Y with coefficients in i, and \{ 9 & L then 6' = 0, 
whereas: ii 9 ^ L then 6'' 7^ and the F-degree of 9' is 1 less than the F-degree of 
6, and the two polynomials 9 and 9' have the same leading coefficient, and hence 
in particular, if 9 is monic then so in 9'. 

Next we note that this is L-linear because for any 5 = 6{Y) G L[Y] and X,ij.gL 
we have 

However, the usual product rule is to be replaced by a twisted product rule because 
by the standard trick of adding and subtracting the same quantity we get 



m' 



d{Y + /3)9{Y + /3) - 5{Y + l3)9{(3) 



+ 



Y 

5{Y + mii) - mm 



=5*9' + 5' 9^ 

where 5* is the Y -translation oi 5 by (3 and 9^ is the evaluation of 9 at (3, i.e., 
S* = S*{Y) = d{Y + P) and 9^ = 9{f3). 

Finally, for any positive integer m we have the power rule 

(F™)' = F-"-! + mpY""-^ + ■■■+ M /S^y™-*-! + ■■■ + m/3'"-i 
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and, in case charL ^ 0, for any power q of charL we have the 'prime power rule 
(y?)' = and combining this with the product rule, we get the power product 
rule 

[v^eiY)]' = {Y + i3ye'{Y) + y-'^eip). 

The reason for explicitly mentioning Y in all this is that there may be other indeter- 

minatcs present; for instance, if ijj = iJj{X, Y) is a polynomial in indctcrminatcs X 
and Y, and (3 is an element in a field which contains X as well as all the coefficients 
of ■0, then the twisted y-derivative of '0 at /3 is given by {il){X,Y) — ip{X, l3))/Y. 

Reverting to the original situation by taking (3 = a and L = K{a) in the above 
set-up, wc conclude with the following. 

Summary about the twisted derivative. If / = f{Y) is a nonconstant monic 
irreducible polynomial in an indeterminate Y with coefficients in a field K such 
that / has no multiple root in any overfield of K , and if a is a root of / in some 
overfield of K, then by letting /' = f'{Y) to be the twisted y-derivative of / at 
a we have that the Galois group Ga.l{f' , K{a)) is the one-point stabilizer of the 
Galois group Gal(/, K). 

Now without assuming / to be irreducible and without any precondition about 
multiple roots, suppose the degree of / is n > 2 and suppose for every root a of 
/(y) in a splitting field of K wc have that the twisted y-derivativc of f{Y) at a 
is irreducible in i4r(a)[y], then f{Y) must be devoid of multiple roots; namely, if 
/(y) = nr=i(^ ~ "^1 ~ '^2 and f'{Y) is the twisted y-derivative of f{Y) 

at a = ai, then f'{Y) is reducible in _ft'(a)[y] because its degree is n — 1 > 1 and 
it has (y — 0:2) as a factor in K{a)[Y]. Thus we have the following. 

Twisted Derivative Criterion. // f{Y) is a nonconstant monic polynomial of 
degree > 2 in an indeterminate Y with coefficients in a field K such that for every 
root a of f(Y) in a splitting field of K we have that the twisted Y-derivative of f(Y) 
at a is irreducible in K{a)[Y], then f{Y) has no multiple roots in any overfield of 
K. 

19. Cycle lemma 

As another tool for calculating Galois groups, let us make note of a "cycle 
lemma" . 

Let K he a, field and consider a monic polynomial 

n 

f = f{Y) = y" + aiy"-i + . . . + a„ = JJ(y _ 

i=l 

of degree n in an indeterminate Y with coefficients ai, . . . , a„ in ii' having pairwise 
distinct roots ai, . . . , a„ in some overfield of K. Now by conveniently enlarging the 
said overfield and moving it by a if-isomorphism, it can be construed to contain 
any prcassigncd overfield K* of K, and this gives us the following obvious but basic 
principle of computational Galois theory. 

Basic Extension Principle. For any given overfield K* of K, the Galois group 
Gal(/, K*), as a permutation group of degree n, acting on the roots {ai, . . . , Q!„}, 
may be regarded as a subgroup of the Galois group Gal(/, K) . 
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Given any overfield K* of K and any factorization 

rn 

f{y) = n ^^^^^ ^^•(^) = + a^i^"^"' + • • • + ajnj 

with a^i, . . . , aj„j. in K* we can relabel the roots ai, . . . , as an, . . . , ai^m, • • • > 

ttml, • ■ • , Ctmn^ SO that 

= ej{Y) = - Uji) for 1 < j < m 

i=l 

and we can identify the direct product x • • • x 5'„^, where S'n^. is the symmetric 
group acting on aji , . . . , ajn^ , with a subgroup of the symmetric group Sn acting 
on ai, . . . , a„. As a second obvious but basic principle we then have the following. 

Basic Projection Principle. For the Galois group Gal{f,K*) c Sn we have 
Gal{f,K*) C Sni X ••• X Sn^, and for 1 < j < rn, the Galois group Gal(/, if*) 
maps onto the Galois group Gal{ej,K*) c Sn^ under the natural projection 3^ x 

• • • X — > Snj ■ 

Recall that a u- cycle is a permutation a, say in Sn, such that for some v dis- 
tinct elements ai-^, . . .,ai^ in {ai,...,a„} we have cr{ai-^) = ai^, ■ ■ .,a{ai^_-^)= 
= ctii and a{aj) = aj for all j ^ {zi, . . . , i„}. Now if ei{Y) is irreducible 
in and if either ni is prime or Gal(ei, K*) is cyclic, then clearly Gal(ei, K*) 

contains an ni-cyclc ti, and by the Projection Principle ti is the projection of some 
r € Gal{f,K*), and if also | Gal(ej, -ftT*)] and rii are coprime for 2 < j < m then 
upon letting fj, to be the product of | Gal(e2, K*)\, . . . , \ Gal(e„, K*)\ we see that 
r'' G Ga\{ f,K*) is an m-cycle. Therefore in view of the Extension Principle we 
get the following. 

Cycle Prelemma. If \Gal{ej,K*)\ and ni are coprime for 2 < j < m, and 
ei{Y) is irreducible in K*[Y], and either rii is prime or Gal{ei,K*) is cyclic, then 
Gal{f,K) contains an ni-cycle. 

To convert the Cycle Prelemma into the Cycle Lemma, let f be a (real discrete) 
valuation'^'^ of K, i.e., u is a map of K onto the set of all integers together with 
the symbol oo such that for all a,b in K we have v{a) = oo 4^ a = 0, and v{ab) = 
v{a) + v{b), and v{a + b) > min{v{a),v{b)). Recall that {a € K : v{a) > 0} is 
called the valuation ring of v, and this ring modulo the unique maximal ideal 
{a G K : v{a) > 0} in it is called the residue field of v. Also recall that v is said to 
be trivial on a subfield k of K, or v is said to be a valuation of K/k, if v{a) = for 
all 7^ a € A:. Let K he a finite algebraic field extension of K and let vi, . . . , w/j be 
the extensions of v to K, i.e., wi, . . . , t)/i are those valuations of K whose valuation 
rings intersected with K give the valuation ring of i>; we may also say that v splits 
in K into i)i , . . . , . By r{vj : v) wc denote the reduced ramification exponent^^ of 
Vj over V, i.e., f{vj : v) is the unique positive integer such that for all a G -ftT we 
have Vj{a) = f{vj : v)v{a). By d{vj : v) we denote the residue degree of Vj over v, 



In this paper, by a valuation we shall mean a real discrete valuation. 
Also called the reduced ramification index. 
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i.e., d{vj : v) is the field degree of the residue field of vj over the residue field of 
V. Note that if either K/K is separable, or v is trivial over a subfield k of K such 
that K/k is finitely generated of transcendence degree 1, then 

h 

(t) J2^{v,:v)d{vr.v) = [K:K]. 

j=i 

Also note that vj is unramified over v, or over K, means that f{vj : v) = 1 and 
the residue field of vj is separable over the residue field of v; vj is ramified over 
V, or over K , means that Vj is not unramified over v; v is unramified in K means 
that Vj is unramified over v for 1 < j < h; and finally, v is ramified in K means 
that V is not unramified in K. Now it is well known that if f{Y) is irreducible 
in -?sr[l^], K = K{ai), K* — the completion of K with respect to v, and ej(Y) is 
irreducible in if*[y] for I < j < m, then h = m and, after a suitable relabelling, 
f{vj : v)d{vj : v) = Uj for 1 < j < h; for instance see §2 of [A2]. Moreover, 
by Newton's Theorem, if the residue field of v is an algebraically closed field of 
the same characteristic as K, and if nj ^ O(chariir) for some j, then for that j 
the Galois group Gal{ej, K*) is cyclic; for a proof of Newton's Theorem based on 
Shreedharacharya's method of completing the square, see my new book on algebraic 
geometry for scientists and engineers [A6]. Therefore by the Cycle Prelemma we 
get the 

Cycle Lemma. If f{Y) is irreducible in K\Y] and there exists a valuation v of K 
such that the residue field of v is an algebraically closed field of the same character- 
istic as K and such thai for the extensions Vi,. . . ,Vh of v to a root field'^^ of f{Y) 
over K we have that f(vj : v) and f{vi : v) are coprime and f{vj : v) ^ O(chariir) 
for 1 < j < h, and either f{vi : v) is prime or f{vi : v) ^ 0(char/r), then the 
Galois group Gal{f,K) contains an f{vi : v)-cycle. 

The Basic Extension Principle can be refined thus. 

Refined Extension Principle. Given any field extensions K C K' C K* . by the 

Basic Extension Principle we may regard Gal(/, if*) < Gal(/, -ftT') < Gal(/, if) < 
Sn, and assuming K* to be a finite normal extension of K we have that Gal(/, K*)<1 
Gal{f,K) and the factor group Gal{f,K)/ Gal{f,K*) is a homomorphic image of 
Gal(/f*,/f).47 

To see this, let L = K{ai, . . . , a„) and L* = K*{ai, . . . , q;„). Now L is a (finite) 
Galois extension of K, and given any a G Gal(/, K), we view cr as a permutation of 
{1,2,..., n} such that for some (actually unique) r G Gal(L, K) we have T{ai) = 
Q;£r(j) for 1 < i < n. Likewise, L* is a (finite) Galois extension of K*, and given 
any cr* G Ga\{f,K*), we view a* as a permutation of {1,2, . . . ,n} such that for 
some (actually unique) r* G Ga\{L*,K*) we have T*{ai) = a„»(i-) for 1 < i < n. 
Obviously, a = a* t = t*\L where r* \L denotes the restriction of r* to L. Thus 



root field of f{Y) over K is a. field obtained by adjoining a root of f{Y) to K, for instance 
the field K{ai). 

^^Thatis, Ga.l{f,K)/Gal{f, K*) Gal(/f*, J!r)/Ar for some normal subgroup AT of Gal(i<:*, K). 
Note that for any finite normal extension K* of a field K, without assuming K* to be separable 
over K, the Galois group Gal(ft'*, K) is defined to be the group of all JC-automorphisms of K* . 
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we get the commutative diagram 

Gal{f,K*) > G&\{f,K) 

I I 

Gal{L*,K*) — ^ Gal{L,K) 

where the left arrow is the isomorphism a* ^ t* , the right arrow is the isomorphism 
cr I— > T, the top arrow is the inclusion Gal(/, K*) C Gal(/, K), and the bottom arrow 
S is the injection r* i— > t*\L. Therefore our assertion is equivalent to saying that 
imJ is a normal subgroup of Ga.\{L,K) and Gsl{L, K)/imS « Gal{K*,K)/N for 
some normal subgroup N of Gal(Jsr*, K). To prove this new version of the assertion, 
let Kq = Ln K*, let Kq be the maximal separable algebraic field extension of Kq 
in K*. let Lg = ^oidt ■ ■ ■ iCtn), and let us depict all this in the following Hasse 
diagram. 

L* 

L. •K* 

K 

Referring to the lower quadrilateral in the above diagram, Lq/Kq is a finite Galois 
extension, the field Lq is a compositum of the fields L and with LDKq = Kq, and 
the four sides of the said quadrilateral represent the finite Galois extensions L/Kq, 
Kq/Kq, Lq/L, and L^K^; hence by the Fundamental Theorem of Galois Theory, 
the group Gal(LQ, Kq) is the internal direct product of the two normal subgroups'*^ 
Ga\{LQ,L) and Ga\{LQ,KQ), and A* ^ X*\L gives a surjection Ga.l{LQ,Ko) —^ 
Gal{L,KQ) whose kernel is Gal{LQ,L) and whose restriction to Gal(-Lo>-^o) 
isomorphismGal(LQ, _R'q) — > Gal(L,_ft'o)- By applying the Fimdamental Theorem 
of Galois Theory to the left triangle in the above diagram, i.e., by noting that 
L/K, L/Kq, and Kq/K are Galois extensions, we see that Gs1{L,Kq) is a normal 
subgroup of Gal(i,fs:) and G&\{L,K) / Gal(L,KQ) « G&\{Ko,K). Upon letting 

Gal(L, A'o) — > Gal(iy, K) be the natural inclusion Gal(i, ifo) C Gal(_L, K), we see 
that the composition G8.\{Lq,Kq) — ^ Gal(L,ifo) G&\{L,K) coincides with 
the injection Gs\{L'^, K^) Gs\{L,K) given by A* ^ A*|L, and hence 

im5o = Gal(L, Kq) < Gal(L, K) 

and 

Gal{L,K)/\m5o w Gal{Ko,K). 



**That is, the intersection of the two normal subgroups is the identity and they generate the 
whole group. 
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By applying the Fundamental Theorem of Galois Theory to the right triangle in 
the above diagram, i.e., by noting that Kq/K, Kq/Kq, and Kq/K are finite Galois 
extensions, we see that 

Ga\{K^,Ko) < Ga\{K^,K) and Ga\{K^,K)/ Gal{K^,Ko) « Ga\{Ko, K). 

Therefore, upon letting A'' = Gb1{Kq, Kq), we conclude that 

im^o < Gal(L, K) and Gal(L, K)/\m5o w GaKi^o*, K)/N 

for some 

iV^GaKi^o*,^)- 

Finally, referring to the modified diagram obtained by deleting the three lines ema- 
nating from Kq in the above diagram, L /K and / K are finite Galois extensions; 
K* /K is a finite normal extension; K* / Kq is pure inseparable; Lq is a compositum 
of L and K^; and L* is a compositum of Lq and K* . Consequently A i-^ A|-K'o 
gives an isomorphism Ga\{K*,K) Ga\{KQ,K), whereas A' i— » X'\Lq gives an 
isomorphism 

Gal{L*,K*) Gal{L*,K*) 
such that (5o(im^') = im^; therefore by the next to last display we conclude that 

imS<GaliL,K) and Ga\{L, K)/imS Ga\{K* , K)/N 

for some A^<Gal(i^*,/i:). 

For applying to specific situations, here are some 

Corollaries of the Refined Extension Principle. Given any finite algebraic 
field extension K' of K , by the Basic Extension Principle we may regard Gal(/, K') < 
Gal(/, K) < Sn, and then upon letting K* be a least normal extension of K con- 
taining K' , we have the following. 

(1.1) There exists N <iGal{K* , K) and M <Gal{f, K) with M < Gal{f, K') such 
that Ga\{f,K)/M « Ga\{K\K)/N. 

(1.2) // Ga\{K*,K) is solvable, then there exists M < Ga\{f,K) with M < 
Gal(/, K') such that Gal(/, K) /M is solvable. 

(1.3) // Gal{K*,K) is solvable, and Ga\{f,K) = Sn, and 3 ^ n 4, then 
Ga\{f,K') = Sn or An. 

(1.4) IfGal{K*,K) is cyclic and Giil{f,K) = S„, then Ga\{f,K') = Sn or A„. 

(1.5) IfGa\{K*,K) is cyclic of odd order and Ga\{f,K) = Sn, thenGa\{f , K') = 

Sn- 

(1.6) If there is no nonidentity group which is a homomorphic image ofGal{f, K) 
as well as Ga\{K*,K), then Ga\{f,K') = Gal{f,K). 

(1.7) // Gal(/, ii") is a simple group which is not a homomorphic image of 
Ga\{K*,K), then Gal(/, K') = Gal(/, K). 

(1.8) IfGa\{K*,K) is solvable and Gal(/, K) is nonabelian simple, then Gal(/, K') = 
Ga\{f,K). 

(1.9) IfGii\{K*,K) is solvable, and Ga\{f,K) = An, and 3 ^ n ^ A, then 

Ga\{f,K') = An. 

(1.10) IfGa\{K\K) is cyclic, andGa\{f,K) = A„, andn = 4, thenGal{f,K') = 
An or (^2)2. 
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(1.11) IfGal{K*,K) is cyclic, andGal{f,K) = A„,andn = 3, thenGal{f,K') = 

An or Zi . 

(1.12) If Ga.\{K* , K) is cyclic of order nondivisible by 3 and Gal{f,K) = An, 
then G-Al{f,K') = A„. 

(1.13) If Giil{K* , K) is solvable and Gal{f,K) = PSL(2,g) for a prime power 
q>S, then Gal(/, K') = PSL(2, q). 

(1.14) IfGal{K*,K) is cyclic of order nondivisible by charfc and Gal{f,K) = 
PSL(2, q) with l<q = n-l= a power of char k, then Gal(/, K') = PSL(2, q). 

(1.1) follows by taking M — Gal(/, K*) in the Refined Extension Prineiple. The 
implication (1.1) =^ (1-2) follows from the fact that a homomorphic image of a 
finite solvable group is solvable. In view of (1.1), the implication (1.2) ^ (1.3) 
follows from the facts that if n > 5 then Sn is nonsolvablc, and An and Sn arc 
the only nonidentity normal subgroups of Sn, and there are no other subgroups of 
Sn between An and 5„, whereas if n < 2 then An and 5„ are the only subgroups 
of Sn- In view of (1.1), the implication (1.3) (1.4) follows from the fact that 
S3 and S4 are noncyclic; A3 is the only nonidentity normal subgroup of S3, for 
the factor group we have 53/^3 = Z2, and the only nonidentity normal subgroup 
of 5*4 other than A4 is the Klein group (^2)^ consisting of the four permutations 
(1), (12)(34), (13)(24), (14)(23), and the factor group of ^4 by the Klein group is 
isomorphic to 53. In view of (1.1), the implication (1.4) (1.5) follows from the 
fact that Sn/An = Z2 or Zi. The implications (1.1) => (1.6) => (1.7) arc obvious. 
The implication (1.7) =^ (1-8) follows from the fact that a homomorphic image of 
a finite solvable group is solvable. In view of (1.1), the implication (1.8) =J> (1.9) 
follows from the facts that if n > 5 then A„ is nonabclian simple, whereas if n < 2 
then An = Z\. The implication (1.1) =^ (1-10) follows from the facts that A4 is not 
cyclic, and the Klein group {Z^f' is the only nonidentity normal subgroup of A4, and 
the factor group by the Klein group is Z3. The implication (1.1) => (1.11) follows 
from the fact that A3 = Z3. In view of (1.1), (1.10), and (1.11), the implication 
(1.9) ^ (1.12) follows from the fact that A^jiZ^f = Z3 = A3. The implication 
(1.8) => (1.13) follows from the fact that PSL(2,g) is nonabclian simple for every 
prime power g > 3. Finally, in view of (1.1), and what we have said about S3 and 
A4, the implication (1.13) (1.14) follows from the facts that PSL(2, 2) = ^3 and 
PSL(2,3) = A4. 

As a consequence of the Refined Extension Principle we have the 

Substitutional Principle. Assume that K = the field k{X) of rational functions 
in an indeterminate X with coefficients in a field k, i.e., ai = ai{X) G k{X) for 
1 <i < n. Given any x{^) G k{X) \ k, let 

fx = fxiY) = + a,{x{X))Y-' +■■■+ anixiX)) G k{X)[Y] = K[Y] 

and let K' = k{V) where V is an indeterminate. Then the k-hom,omorphism 
X x(^) gives an embedding K = k{X) C k{V) = K' , and by sending Y to 
Y it gives an embedding K[Y] = k{X)[Y] c fc(y)[F] = and this sends f to 

f^ with X changed to V . Therefore, f^ has no multiple roots in any fi,eld exten- 
sion of k{X) and, upon letting K* = a least normal extension of k{X) containing 
k{y), by the Basic Extension Principle we may regard Gal(/, K*) < Gal(/, k{V)) = 
Gal{f^,k{X)) < Gal{f,k{X)) < Sn, and now by the Refined Extension Principle 
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Ga\{f,K*) <iGsi\{f,k{X)) and Ga\{f,k{X))/Gal{f,K*) « Gal{K*,k{X))/N for 
some N <iGa\{K*,k{X)).'^^ 

Here are the corresponding 

Corollaries of the Substitutional Principle. Letting the situation be as in the 
Substitutional Principle, and remembering that K* = a finite normal extension of 
k{X) and Gal{f^,k{X)) < Gal{f,k{X)) < S„, we have the following. 

(2.1) There exists N<iGal{K* , k{X)) and M<iGal{f,k{X)) with M < Ga\{f^,k{X)) 
such that G&\{f,k{X))/M « Ga\{K\k{X))/N . 

(2.2) If G&\{K* ,k{X)) is solvable, then there exists M <Ga\{f,k{X)) with M < 
Gal(/x,fc(X)) such that Ga\{f,k{X))/M is solvable. 

(2.3) // Ga\{K*,k{X)) is solvable, and Gal(/, kiX)) = Sn, andS^nj^A, then 
Gal(/x,fc(X)) = 5„ orA„. 

(2.4) IfGa\{K*,k{X)) is cyclic and Gal(/, k{X)) = Sn, then Ga\{f^, k{X)) = Sn 
or An. 

(2.5) // Ga\{K* ,k{X)) is cyclic of odd order and Ga\{f,k{X)) = Sn, then 
Ga\{f^,k{X)) = Sn. 

(2.6) If there is no nonidentity group which is a homomorphic image ofGa\{f, k{X)) 
as well as Ga\{K*,k{X)), then Gal{f^,k{X)) = Gal(/, fc(X)). 

(2.7) If Ga\{ f,k{X)) is a simple group which is not a homomorphic image of 
GaliK*, kiX)), then Gal{f^,k{X)) = Gal{f,k{X)). 

(2.8) If G;il{K* , k{X)) is solvable and G&1{ f,k{X)) is nonabelian simple, then 
Ga\{f^, k{X)) = Gal(/, k{X)). 

(2.9) IfGal{K*,k{X)) is solvable, and Gal(/, k{X)) = An, andS^n^A, then 
Gal(/x,fc(X))=A„. 

(2.10) If Ga\{K* , k{X)) is cyclic, and Ga\{f,k{X)) = An, and n = 4, then 
Galif^,kiX)) = An or {Z^f . 

(2.11) If Ga\{K*,k{X)) is cyclic, and Gal(/,fc(X)) = An, and n = 3, then 
Ga\{f^,k{X))=An orZ^. 

(2.12) If Gal{K* , k(X)) is cyclic of order nondivisible by 3 and Gal{f,k{X)) = 
An, then Galif^,k{X)) = An- 

(2.13) If Ga\{K*,k{X)) is solvable and Gal{f,k{X)) = PSL(2,g) for a prime 
power q>3, then Gal{f^, k{X)) = PSL(2, q). 

(2.14) If Ga\{K* , k{X)) is cyclic of order nondivisible by char k andGal{ f,k{X)) = 
PSL(2,g) withl<q = n-l = a power of char k , thenGalif^,k{X)) =PSL{2,q). 

The proof of (2.1) to (2.14) foUows from the above proof of (1.1) to (1.14) by 
changing "Refined Extension Principle" to "Substitutional Principle" and by chang- 
ing (l.j) to (2.i) for 1 < i < 14. 

Stated in a form more suitable for applying to the specific equations described 
earlier, here are some further 

Corollaries of the Substitutional Principle. Letting the situation be as in the 

Substitutional Principle, and remembering that Ga[{f-^,k{X)) < Gal(/, fc(X)) < 
Sn, and assuming that k is algebraically closed and x{-^) = cX'' with ^ c € k 
and nonzero integer r, we have the following. 

''^Note that by writing x{^) = x'(^)/x"(^)i where x'{^) E'ld x"{^) ^^e coprime nonzero 
polynomials in X with coefficients in k, we have K* = a splitting field of x'O^) ~ ^x"0^) over 
k{X), and if the said splitting field coincides with a root field of x'(^) ~ ^x"{Y) over k{X), then 
we have Gal(/, K*) = Gal(/, k{V)) = Gal(/x, fc(X)). 



GALOIS THEORY ON THE LINE IN NONZERO CHARACTERISTIC 



33 



(3.1) There exists M<iGa.l{f,k{X)) wtth M < Gal{f^.k{X)) such that Gal{f,k{X))/M 
is a cyclic group whose order is nondivisihle by charfc but divides r. 

(3.2) //Gal(/, k{X)) = Sn, then Gal(/x, k{X)) = Sn or A„. 

(3.3) //Gal(/,fc(X)) = S„ andchark = 2, then Gal(/x, A:(X)) = S„. 

(3.4) // Gal(/, fc(X)) is nonabelian simple, then Gal{fy(^, k{X)) = 
Gal(/,fc(X)). 

(3.5) //Gal(/, k{X)) = A„ and 3 ^ n ^ 4, then Gal(/x, k{X)) = A„. 

(3.6) //Gal(/, k{X)) = An and n = 4, then Gal(/x, k{X)) = A„ or (Za)^. 

(3.7) //Gal(/, k{X)) = An and n = 3, then Gallf^, fc(X)) = An or Zi. 

(3.8) IfGal{f,k{X)) =An a«rf charfc = 3, then Ga\{f^,k{X)) = A„. 

(3.9) // Gal(/, k{X)) = PSL(2, q) for a prime power q > 3, thenG&\{f^, k{X)) = 
PSL(2,g). 

(3.10) // Gal(/, k{X)) = PSL(2, q) with l<q = n-l = a power o/char fc, then 
Gal(/^,fc(X)) = PSL(2,g). 

Namely, Gal(if*, if), i.e., the Galois group of the splitting field of yl''' - (f )''^''^' 
over k{X) is a cyclic group whose order is nondivisihle by charfc; but divides r, and 
hence (3.1), (3.2), (3.3) follow from (2.1), (2.4), (2.5) respectively, and (3.i) follows 
from (2.Z + 4) for 4 < i < 10. 

Here arc still some more 

Corollaries of the Substitutional Principle. Letting the situation be as in the 
Substitutional Principle, and remembering that Gal(/^,fc(X)) < Gal(/, fc;(X)) < 
Sn, and assuming that 

x{X) = Xd{Xd-i{---{xi{X)) ■■■)), 

where d is a positive integer and 

Xi{X) = ^^ek{X)\k 

Xi K^) 

with coprime nonzero members Xii^) and x'/iX) of k[X] for 1 < i < d, and upon 
letting K* to be a splitting field of x'iO^) — ^Xi'(^) ^^^^^ ^(^) /o*" 1 < i < d, we 
have the following. 

(4.1) If for each i with I < i < d we have that there is no nonidentity group 
which is a homomorphic image of Gal{f,k{X)) as well as Gal{K*,k{X)), then 
Gal(/^,fc(X)) = Gal(/,fc(X)). 

(4.2) // Gal(/, fc(^)) is a simple group which is not a homomorphic image of 
Gal{K*,k{X)) for any i with l<i<d, then Gal{f^,k{X)) = Gal{f,k{X)). 

(4.3) If Gal{K* , k{X)) is solvable for each i with 1 < i < d, andGal{f,k{X)) is 
nonabelian simple, then Gal(/^,fc(X)) = Gal(/, fc(X)). 

(4.4) If Ga\{K*,k{X)) is solvable for each i with 1 < i < d, andGai{f,k{X)) = 
An, and 3 ^ n ^ 4, then Gal(/x, k{X)) = An- 

Namely, (4.1), (4.2), (4.3), and (4.4) follow by repeatedly applying (2.6), (2.7), 
(2.8), and (2.9) respectively. 

Although we shall not use it in this paper, here is a third basic principle of 
computational Galois theory; for a proof see §61 of volume I of van der Waerden's 
book [V], and for some applications see the 1958 follow-up [A4] of my 1957 paper. 
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Basic Homomorphism Principle. If f{Y) G -Rt,[l^] where Ry is the valuation 
ring of a valuation v of K and if f{Y) has no multiple roots in any overfield of 
K, where K is the residue field of v and f{Y) e KiY) is obtained by applying the 
canonical epimorphism Ry K to the coefficients of f(Y), then, as a permutation 
group, the Galois group Gal(/, K) may be regarded as a subgroup of the Galois 
group Gal(/, K). 

To give some concrete examples of valuations, assume that K = k{x) where x is 
a transcendental over a field k. Then for every nonconstant irreducible (l){x) G k[x] 
we get a valuation of k{x)/k by taking 

/ 7r(x)(/)(x)' 

for any integer I and any nonzero 7r(.x) and e{x) in fc[x] which arc nondivisible by 
4>{x); we may call this the (j){x) = valuation of k{x)/k; if = x — c with c € k 
then we may also call it the x = c valuation of k{x)/k; note that for any other 
nonconstant irreducible (f)*{x) S k[x] we have = w^* <^ (p and (j)* are constant 
multiples of each other. In addition to these valuations, there is exactly one more 
valuation Voo of k{x)/k given by taking 



'k{x) 

Kx) 



dege(a;) — deg7r(a;) 



for all nonzero 7r(a;) and e{x) in k[x]; we may call this the x = oo valuation of 
k{x)/k. 

Let A > > be integers, and consider the polynomial S^{Z)Y^ + ri{Z)Y'^ in 
indeterminates Y and Z where S,{Z) and ri{Z) are nonzero coprime polynomials in 
Z with coefficients in k. Let y be an element in an overfield of K = k{x) such that 
^{x)y^ + rj{x)y^^ — 0. Now, upon letting vi,. . . ,Vh be the extensions oi v = Vij, to 
K = K{y), for 1 < J < /i we have 



r{vj : v)v 



r{vj : v)v{^{x)r]{x)) 
and hence 

f{vj : v) = 
and therefore by (f) we get 

(tt) [K:K]=0 



X 



ax) 



X — fj, 



GCB{X-^i,vi^{x)r,{x))) 
X — fj, 



GCB{X-^i,vi^{xUx))) 
Prom this we deduce the following 

First Irreducibility Lemma. Let A > /x > be integers, and consider the poly- 
nomial £_{Z)Y'^ + r]{Z)Y'^ in indeterminates Y and Z where £,{Z) and r]{Z) are 
nonzero coprime polynomials in Z with coefficients in a field k. Let y be an ele- 
ment in an overfield of k{Z) such that ^{Z)y^ + ri{Z)y^ = 0. Assume that there 
exists a finite number of nonconstant irreducible polynomials (j}i{Z), . . . , <f)m{Z) in 
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Z with coefficients in k such that, upon letting Vi be the largest integer for which 
4>i{Zy^ divides ^{Z)ri{Z) in k[Z], we have GCD(A — /x, J^i, . . . , = 1- Then 
[k{y,Z) : k{Z)] = \ — ji, and the polynomial £^{Z)Y^ + rj{Z)Y'^ is irreducible in 
k{Y)[Z].^° 

Namely, by (ff) we see that 
and clearly 

= TCM ( _^^J^_\ 
GCD(A - ^^1, • • • , vm) VgCD(A - i^i) ' ■ ■ ■ ' GCD(A - /x, v^) ) 

and hence 

[.to.Z):M^)1.0( oCB(,_\X. ....„) )■ 
Since GCD(A — ii,y\, . . . , Vm) = 1, we get 

[k{y,Z):k{Z)]=0{\-ii) 

and since y^~^ = —r]{Z)/^{Z), we conclude that 

[%,Z):fc(Z)] = A-M 

and the polynomial Y^~i^ + ri{Z)/S^{Z) is irreducible in fc(Z)[y]. Therefore by 
the Gauss Lemma, ■''-'^ the polynomial £_{Z)Y^~'^ + rj{Z) is irreducible in fc[Y, Z], and 
hence again by the Gauss Lemma, the polynomial ^(Z)y'^~^ + r;(^) is irreducible in 
k{Y)[Z], and therefore the polynomial ^{Z)Y^ + r]{Z)Yi^ is irreducible in k{Y)[Z]. 
Let us now convert the above lemma into the following 

Second Irreducibility Lemma. Let A > /i > 0. a,nd let £^x(Y, Z) and rj^{Y,Z) 
be nonzero homogeneous polynomials of respective degrees A and fi in (Y, Z) with 
coefficients in a field k. Assume that the polynomials ^\{Y,Z) and r}^^{Y,Z) are 
regular in Z^'^ and the polynomials and rjf^{l,Z) have no nonconstant 

common factor in k\Z\. Also assume that there exists a finite number of nonconstant 
irreducible polynomials 4'i{Z), . . . , (j)m{Z) in Z with coefficients in k such that, upon 
letting Vi to be the largest integer for vjhich (hi{ZY^ divides Z)ry^(l, Z) in k[Z], 
we have GCD(A — iJ,,i^i, . . . ,Pm) = 1- Then the polynomial S,\{Y, Z) + r]n{Y, Z) is 
irreducible in k(Y)[Z]. 

To prove this, note that Z i— > YZ gives a A;(y)-automorphism of which 
sends the polynomial £,x{Y, Z)+r]^,{Y, Z) to the polynomial ^a(1, Z)Y^+r]^^{l, Z)Y^'. 



80That is, ?(Z)y^ + 77(Z)y'' is either a nonzero element of k{y) or a nonconstant irreducible 
polynomial in Z with coefficients in k{y). 

^^The Gauss Lemma says that a nonzero polynomial in indeterminates Y and Z with coef- 
ficients in a field k is irreducible in k[Y,Z] if and only if it is irreducible in k{Z)[Y] and, as a 
polynomial in Y, its coefficients have no nonconstant common factor in k[Z]. 

^^That is, their degrees in Z coincide with their degrees in {Y,Z). 
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By the First Irreducibility Lemma, the second polynomial is irreducible in A;(y)[Z] 
and hence so is the first. 

As an illustration, let n > f > 1 be integers such that GCD{n,t) = 1. Now if 
u is any clement in an algebraic closure fc of fc such that u" = 1 = m*, then, since 
GCD(n, t) = 1, we must have u = 1; therefore 1 is the only common root of — 1 
and Z* — 1 in k. Again since GCD{n,t) = 1, either n or t is nondivisible by the 
characteristic of k, and hence either — 1 or — 1 is devoid of multiple roots 
in k. Therefore the polynomials (Z" - 1)/{Z - 1) and (Z* - 1)/{Z - 1) have no 
common root in k and at least one of them has no multiple root in k, and hence 
they have no nonconstant common factor in k[Z] and at least one of them has no 
nonconstant multiple irreducible factor in k[Z]; since n > t > 1, we conclude that 
their product has at least one nonconstant nonmultiple irreducible factor in 
By applying the fc-automorphism Z i-^ Z + 1 of k[Z] and by multiplying the second 
polynomial by —a, where a is any nonzero element of k, we see that the polynomials 
{Z + 1)" — l)/Z and —a[{Z + 1)* — 1]/Z have no nonconstant common factor in 
k[Z] and their product has at least one nonconstant nonmultiple irreducible factor 
in k[Z]. Consequently by taking {Z + Y)" - Y"/Z and -a[{Z + ¥)* - Y*]/Z for 
£,x{Y, Z) and Z), and by letting (/)i(Z), . . . , <t>m{Z) to be the distinct monic 

nonconstant irreducible factors of Ca(i,z)'7a'(1: ^) in k[Z\, and by letting Vi to be 
the largest integer for which 4ii{ZY^ divides Z)rif^{l, Z) in k[Z\, we see that 
i\(Y, Z) and rj^{Y, Z) in k[Z] are nonzero homogeneous polynomials of respective 
degrees A = n — 1 and /i = t — 1 in [Y. Z) with coefficients in k such that the 
polynomials £,\{1,Z) and 77^(1, have no nonconstant common factor in k[Z], and 
for some i we have Vi = 1, and hence trivially we have GCD(A — /x, i/i, . . . , Vm) = 1- 
Therefore by the Second Irreducibility Lemma we get the following 

Third Irreducibility Lemma. Let n > t > 1 he integers such that GCD(n.f) = 
1. Let Y and Z be indeterminates over a field k, and let ^ a € k. Then the 
polynomial {{Z + F)" - F^/Z - a[{Z + Yf - Y^]/Z is irreducible in k{Y)[Z]. 

20. The tilde polynomial and borrowing cycles 
Let us now apply the twisted derivative method to the polynomial 

Fn,s = KAx, F) = y" - ay* + X' 

with 1 < t < n and GCD(n,t) = 1, where s is a positive integer, a is a nonzero 
element in a field k of characteristic p,^^ and X and Y are indeterminates over k. 
We want to calculate the Galois group Gn,s = Gal(i^„,s, k{X)).^^ 
Let k be an algebraic closure of k, and let 

(i) d = — — - where e = < , , ' 

e { max p** with n-t = 0{p^') ifpj^O. 

^■^That is, there exists a nonconstant irreducible member of k[Z] which divides the said product 
but whose square does not divide the said product. 

^■'Here p may or may not be zero. Later on we shall specialize to the case of p ^ 0; with the 
further assumption that n s 0(p) and s = 0(t), the polynomial Fn,s reduces to the polynomial 
Fn considered in §11. 

^^In a moment we shall show that Fn.s has no multiple roots in any overfield of k{X), and 
hence the Galois group Gal(F„_s, A;(X)) is defined. 
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Now 

d 

(ii) F" - aF* = y* YliY - 

i=l 

with pairwise distinct nonzero elements in k and GCD(e,t) = 1, and 

hence by the First Irreducibihty Lemma we see that Fn,8 is irreducible in 
and therefore Fn,s is irreducible in k{X)[Y].^^ 

Let us put X = X and let y he a root of F„,s in an overfield of k{x). Then 

(iii) X* = ay* - 

and hence y is transcendental over k. Let E{y, Z) be the polynomial in an in- 
determinate Z such that E{y,Y) is the twisted F-derivative of Fn,s{x,Y) at y. 
Then 

E(v Z) = + _ Ci[{Z + yY-y'] 

and hence by the Third Irreducibihty Lemma we see that E{y, Z) is irreducible in 
k{y)[Z]. 

For a moment suppose that s = 1 < t: then by (iii) we have x € k{y), and 
hence E{y,Z) is irreducible in k{x,y)[Z], and therefore by the Twisted Derivative 
Criterion we see that Fn,i{X,Y) has no multiple roots in any overfield of k{X), 
and so Gal(F„,i, fc(JY')) is defined; since E{y,Z) is irreducible in k{x,y)[Z], it also 
follows that Gai{Fn^i , k{X)) is 2-transitive. Thus, if s = 1 <t then Fn^i is devoid 
of multiple roots and Gn,i is ^-transitive. 

Reverting to general s but still assuming t > 1, since Fn^s{X,Y) = i^„_i(X*,F) 
and F„.i(X, F) has no multiple roots in any overfield of k{X), it follows that 
F„_s(X, F) has no multiple roots in any overfield of k{X). Thus, if t > 1 then 
Fn^s is devoid of multiple roots and hence Gn,s is defined. 

To give a direct proof of Fn,s being devoid of multiple roots, let us calculate its 
F-discriminant. 

So first recall that the F-discriminant of a monic polynomial / = /(F) of degree 
n > in F with coefficients in a field K is denoted by Discy(/) and is defined by 
putting 

Discy(/) — Rcsy (/, fy) = thc F-icsultant of / and fy, 

where fy is the (ordinary) F-derivative of /. Upon letting m be the degree of fy, 
we note that Resi-(/, fy) is the determinant of a certain n + mhy n + m matrix; 
also note that m = n — 1 <^ n ^ O(charif); finally note that if fy = 0, i.e., if 
chari^ = p ^ and / S K[YP], then we take Discr(/) = 0. For calculational 
purposes, upon letting / = Hi^iC^ ~ observe that 

n 

Biscy{f)=l[fy{ai) 



^®For s = 1 this also follows by noting that F„,i is monic of degree 1 in X. 



38 



S. S. ABHYANKAR 



and upon assuming fy^^O and upon letting fy = e]XiLi{y — Cj) we observe that 

m 

Discr(/) = (-l)""e"n/(^^)- 

Finally, upon letting 

Discyf/) = the modified y-discriminant of = TT (ai — aj)^ 

we note that^'' 

Disc;.(/) = (-l)"("-i)/2Discy(/). 
Let us record the following well-known 

Discriminant Criterion. // char K ^ 2 and f is devoid of multiple roots, then: 
Gal{f,K) c An ^ Discyif) is a square in K. 

Now for the (ordinary) F-derivative of Fn,s we have 

{Fn,s)r = nY''-^ - taY'-^ 

-taY^-'^^O ifn = 0(p), 

r^^*-'^^=l*(^-M7^0 ifn^O(p), 

with b,(Ji, . . . ,u)n-t in k such that nb'^~^ = ta and nr=i*(^ ~ ^i) ~ yn-t _ i and 
hence 

f(-l)"(*-i)(-ta)"(X^)*-i if n = 0{p), 

Ui=![X' - n-\n - t)ab*Lol] if n ^ 0(p), 
and therefore, because GCD(n — t,t) = 1, we get 

(-l)"*t"a"X«(*-i) ifn = 0(p), 

X^it-l) [^nj^s(n-t) _ _ t)"-tf*a"] if n # 0(p), 

and, observing that if n = 0{p) then in k we have n" = and — (n — t)'^~*t* = 
(— = (— we conclude that in both the cases we have 

Discy(F„,,) = - (n - f)"-*t*a"X"(*^i). 

Thus always Discy(_F„.s) ^ and hence Fn_s is devoid of multiple roots. 

If s = 1 and n — t ^ 0{p) then by (i), (ii), (iii) we see that the valuation 
a; = of k{x)/k splits in k{x,y) = k{y) into the n — t + 1 = d + \ valuations 
y = 0,y = Pi, . . . ,y = (3d with reduced ramification exponents t, 1, . . . , 1, and hence 
if also t ^ 0{p) then by the Cycle Lemma we can find a i-cycle in Gal(i^„,i, 
and therefore, since by the Basic Extension Principle we have Gal(i^„,i, < 
Ga\{FnA-k(X)), wc get a t-cycle in Gal(F„,i, Thus, if n - t ^ 0(p) and 

t ^ 0(p) then Gn,i contains a t-cycle. 

Let us note the following Corollary of an 1871 Theorem of Jordan [J2] ; a proof 
can also be found in 13.3 on page 34 of Wielandt [Wi], and in 4.4 on page 171 of 
Volume I of Huppert [HB]. 



Discy(F„,s) = 



(iv) Discy(i^„,s) 



^^On pages 82-87 of volume I of van der Waerden's book [V], it seems to be wrongly asserted 
that DisCy(/) = Discy(/). Some authors call DisCy(/) the discriminant of /. 

®*Note that if n and t are both odd then n — t + 1 and nt axe both odd, whereas if one out 
of n and t is odd and the other even then n — t + 1 and nt are both even, and finally, since 
GCD(n,t) = 1, the remaining possibility of n and t both being even cannot occur. 
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Jordan's Corollary. An and Sn are the only primitive permutation groups of 
degree n containing a 3-cycle. 

Let us also note the following two 1892 Theorems of Marggraff [Mar]; these are 
given as Theorems 13.5 and 13.8 on pages 35 and 38 of Wielandt [Wi] respectively.^^ 

Marggraff's First Theorem. An and Sn are the only primitive permutation 
groups of degree n having a {n — y)-point stabilizer, with 1 < v < n/2, which acts 
transitively on the remaining v symbols. 

Marggraff's Second Theorem. A primitive permutation group of degree n con- 
taining a u-cycle, with 1 < v <n, is {n — v + 1) -transitive. 

For a moment suppose that n — t ^ 0{p) and 1 < t ^ 0(p).^" Now, in view of 
what we have proved above, Gn,i is a 2-transitive permutation group containing a 
t-cycle and hence if t = 2 then obviously^^ Gn,i = <?„; ii t = 3 then by Jordan's 
CoroUary^^ we have Gn,i = An or 5„; iit<^ then by Marggraff's First Theorem 
we have Gn,i = An or S'„; if i < n — 4 then by Marggraff's Second Theorem and 
CTT we have G„_i = An oi Sn', ii t = n — 4 then automatically 12 7^ n ^ 24 and 
hence by Marggraff's Second Theorem and CTT we have G„,i = An or Sn', and if 
t = n — 3 then again automatically 12 7^ n ^ 24 and hence if also 11 7^ n 7^ 23, then 
by Marggraff's Second Theorem and CTT wo have G„.i = An or 5„. In all these 
cases, if t is even then we must have G„^i = Sn because An cannot have a cycle of 
even length, and if t is odd and p ^ 2 then, in view of the Discriminant Criterion, 
by (iv) we can unambiguously decide between An and Sn] in particular, if k is 
algebraically closed and n = 0{p) and p ^ 2 and t is odd then we get An because 
in that case Disc^(i^„,i) is a square in k{x). Again, in all these cases, assuming 
k to be algebraically closed, in view of Corollaries (3.2) to (3.8) of the Substitu- 
tional Principle, we see that (1) t is even Gn,i = Sn ^ Gn,s = An or Sn', (2) 
t is odd and n = A =^ Gn,i = An or Sn with n = 4 => Gn,s = (-^2)^ or An or Sn', 
(3) t is odd and n > 4 and p = 2 => Gn,i = or Sn with n > 4 => Gn,s = 
An or Sn', (4) t is odd and n = 4 and p ^ 2 and DisCy(F„.i) is a square in k{x) =^ 
Gn,i = -^n with n = 4 ^ Gn,s = (^'2)^ or An', (5) t is odd and n > 4 and p ^ 
2 and Discy(F„_i) is a square in k{x) ^ Gn,i = An with n > 4 Gn,s = ^nl and 
finally (6) t is odd and p ^ 2 and Disc^(F„,i) is not a square in k{x) Gn,i = 

Sn ^ Gn,s — An OT Sn. 

Now for a moment suppose that n = 0{p). Then t ^ 0{p) and hence by (iv) we 
see that x = and a; = 00 are the only valuations of k{x)/k which are possibly 
ramified in k{x,y). Moreover, if s = Q{t) then, in view of (i), (ii), (iii), either by 
direct reasoning we see that the valuation x = of k{x)/k is unramified in k{x, y), 
or alternatively, first, upon letting x* = x^ we see that the valuation a;* = of 
k{x*)/k splits in k{x*,y) = k{y) into several valuations out of which y = is 



®® Apparently, Marggrail's Second Theorem can also be found in [J2]; see Neumann [N]. 
^''Note that if n s 0(p) then automatically n — t ^ 0(j>) and t ^ 0(p) because by assumption 
GCD(n,t) = 1. 

®l A 2-cycle is simply a transposition, and a 2-transitive permutation group containing a trans- 
position must contain all transpositions and hence must be the symmetric group. 
®^A 2-transitive permutation group is automatically primitive. 
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the only valuation which is possibly ramified over k{x*) and for it the reduced 
ramification index is t and the residue degree is 1, and now upon letting 

X if p = 0, 

where s' = max with s = 0{p^) if p ^ 0, 

we see that x/**/* = x* and s/s' is a positive integer which is divisible by t but 
not divisible by p and hence by MRT^^ we see that the valuation x' = of k(x') is 
unramificd in k{x',y), and finally from this we deduce that the valuation .t = of 
k{x)/k is unramified in k{x,y).^'^ 

Thus, if n = 0{p) and s = 0(t), then a; = oo is the only valuation of k{x)/k which 
is possibly ramified in k{x,y), and hence^^ a; = c» is the only valuation of k{x)/k 
which is possibly ramified in a least Galois extension of k{x) containing k{x,y). 
It only remains to note that, by Result 4 on page 841 of the 1957 paper [A3], as 
a consequence of the genus formula, every member of the algebraic fundamental 
group of the afBne line over an algebraically closed field k of characteristic p ^ 
is a quasi p-group; therefore in our case, if Gn.s = An or Sn and p ^ 2 then we 
must have Gn,s = An, because clearly Sn is not a quasi p-group for any n > p > 2; 
likewise, if Gn,s = (■^2)'^ or An or Sn with n = 4 and p = 2, then we must have 
Gn,s = (-^2)^ or Sn because A4 is not a quasi 2-group. 

Let us put all this together in the following 

Summary about the tilde polynomial. Lot k be an algebraically closed field 
k of characteristic p ^ 0, and consider the polynomial Fn — — aY* + X" in 
indeterminates X and Y over fc, where a is a nonzero element of k, and n, s, t are 
positive integers with GCD(n, t) = 1 and t < n = 0(p) and s = 0(t). Then Fn 
gives an unramified covering of the affine line over k, and for the Galois group 
Gn = Gal(F„, k{X)) we have the following. 
(Ill) If 1 < t < 4 and p 7^ 2, then G„ = A„. 

(11.2) If 1 < i < n - 3 and p 7^ 2, then G„ = A„. 

(11.3) If 1< i = n - 3 and p 7^ 2 and llj^ p ^ 23, then G„ = A„. 

(11.4) Ifl<i<4<n and p = 2, then G„ = A„ or Sn- 

(11.5) Ifl<t<n-3 and p = 2, then G„ = An or Sn- 

Here CT was not used in the proofs of (11. 1) and (11.4). Moreover, the following 
special cases of (II. 2) and (11.5) were proved without using CT. 

(II.2*) If 1 < i < n/2 and p 7^ 2, then G„ = An- 
(II.5*) If 1 < i < n/2 and p = 2, then G„ = An or Sn- 

Note- In the next section we shall consider the unramified covering of the affine line 
given by the polynomial F„ = F" — XY* + 1 with n = p + t and t ^ 0(p) which 



S3MRT=Abhyankar's Lemma = pages 181-186 of [A5]. 

^''This last deduction follows from the easy to prove fact which says that if a valuation v of a 
field K is unramified in a finite separable algebraic field extension L of K then the unique extension 
of I) to a finite purely inseparable field extension K' of K is unramified in the compositum of L 
and K'. 

^^Say by Proposition 1 of [Al]. 

®®In view of the above discussion, without using CT we see that ifl<t<4 = n and p = 2, 
then Gn = (-^2)^ or S„, and using CT we see that ifl<t = n — 3 and G„ 7^ A„, then either 
n = p = 11 and Gn = Mn, or n = p = 23 and Gn = M23- 
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was introduced in §11; there the calculation of the Galois group of Fn will be based 
on the fact that it contains a p-cycle because the valuation X = oo splits into the 
valuations Y = and Y = oo with reduced ramification exponents t and p respec- 
tively.^^ In the present section we considered the unramified covering of the afRne 
Une given by the polynomial F„ = — oF* + with n = 0(p) and GCD(n, t) = 
1 and s = 0{t) which was also introduced in §11; contrary to the Galois group of 
the polynomial Fn, it is not easy to find any cycle in the Galois group of the poly- 
nomial F„ because now the valuation X = (X) has the valuation F = oo as the only 
extension and for it the reduced ramification exponent is n which is a multiple of 
p; so what we did was to "borrow" a t-cycle by going down to the subfield k{X'^) of 
k{X), i.e., by embedding the Galois group over k{X) as a subgroup of the Galois 
group over klX") where the latter does contain a i-cycle. This method of borrow- 
ing cycles can already be found in Hilbert's 1892 paper [H] where he "borrows" a 
transposition to embed An in Sn, thereby constructing An coverings of the rational 
number field Q. 

21. The bar polynomial 
Let us now turn to the polynomial 

Fn,q = y" - XF* + 1 

with n = q + t and positive integer t ^ 0{p) mentioned in §11. This is a polynomial 
in indeterminates X and Y with coefficients in a field k of characteristic pj^O, and 
g is a positive power of p. 

The (ordinary) F-derivative of Fn,q is given by 

iFn,gh = niY"~' - XY'-') 

and so we get 

— F — 

Fn,q = —{Fn,q)Y+l 

and hence for the F-discriminant of Fn,q we have 

Discy(i<' = n" = a nonzero element of k. 

Therefore this gives an unramified covering of the affine line over k. At any rate, 
Fn,q, as a polynomial in F, is devoid of multiple roots, and so we can talk about 
its Galois group over k{X). We are interested in calculating this Galois group 
G„,, = Gal(F„,g,fc(X)). 

Now Fn,q, as a polynomial in X, is linear and in it the coefficient of X has no 
common factor with the terms devoid of X. Therefore Fn,q is irreducible in k[Y] [X] 
and hence in fc(X)[F]. Consequently, Gn,q, as a permutation group of degree n, is 
transitive. 

Regarding Fn,q as a polynomial in F and reciprocating its roots we get the 
polynomial 

e = e(F) = F«+* - xYi + 1, 

^^Actually, by reciprocating the roots of F„ we shall get the polynomial — XY^ + 1 for 
which the valuation X = oo splits into the valuations Y = and Y = oo with reduced ramification 
exponents p and t respectively. 
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where we have put x — X. Let y be a root of this polynomial in some overfield of 
k{x). Then solving 6(j/) = for a; we get the equation x = y* + The q = p 
case of this equation was really the starting point of this paper and it was originally 
obtained by taking ft, = ci = 1 in Proposition 1 of the 1957 paper [A3]. The case of 
general q can also be obtained by taking h = q/p and c^/j, = 1 and q = for 1 < 
i < q/p in that Proposition. By the said Proposition, or directly by looking at 
the above equation, we see that (1) the simple transcendental extension k{y) of 
fc is a separable algebraic field extension of the simple transcendental extension 
k{x) of k with field degree [k{y) : k{x)] = q + t; (2) Voo : a; = oo is the only 
valuation of A;(a;)/fc which is ramified in k{y); (3) v^o splits in k{y) into the valuations 
Wo : y = and Woo : y = oo; and (4) for the residue degrees and reduced ramification 
exponents we have d{wo : Uoo) = 1 and f{wo ■ Voc) = q, and d{u)c,c : I'oo) = 1 and 
f{vjao ■ Voo) = t. Now Gn,q = Gal(F„_g, k{X)) = Gal(6, k{x)), and hence in view of 
(3) and (4), by the Cycle Lemma we see that if k is algebraically closed and q = p 
then Gn,p contains a p-cycle; by the Basic Extension Principle, as a permutation 
group, G&\{Fn,qjk{X)) is a subgroup of Gsi\{Fn,q,k{X)) where k is an algebraic 
closure of k; therefore, without assuming k to be algebraically closed, we have that 
if q— P then Gn,p contains a p-cycle. 

Let prime denote the twisted F-derivative at y. Then 



= [y«(F* - x)]' + (1)' (by linearity) 

= [y9(y* - a;)]' (because constant' = 0) 

= {Y + y)" (F* - x)' + F^"^ (?/* - x) (by power product rule) 

= {Y + y)«(F*)' + y«-^(y* - x) (because constant' = 0) 

= {Y + y)«(F*)' - y-«y«-^ (because x = y* + ?/-«) 



and hence by the definition of (Y*)' we get 



and therefore 



e'(y) = [{Y + + y' _ 



e'(o) = 7^ 0. 



Let A(y) be the polynomial obtained by reciprocating the roots of 0'{Y). Then 
A(F) is a monic polynomial of degree q + t — 1 in Y with coefficients in k{y) and 
we have 



A{Y) 



, e' - 



1 yi {w) [{y + yf-y'] _ / y-i \ 

y J I (i) Vty«+*-i ; v^9-i J 



IK^ + y ) \ ^ ty^g+t-i 



Let A{Z) be the polynomial obtained by multiplying the roots of A{Z) by ty. 
Then A{Z) is a monic polynomial of degree q + t — 1 in an indeterminate Z with 
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coefBcients in k{y) and we have 
A{Z) ={tyr+'-'A 



tyj 



z 
ty 



{tyf-h 



















(«)'] 



ty2q+t-l 



=[{z + ty] 

and hence 



(z + ty-z* f-'^z* 



t2 



i/q+t 



where j{Z) is the monic polynomial of degree t — 1 in Z with coefficients in the 
prime field kp G k given by 



j{Z) =t-'^[{z + tY - Z* 



t-i 

E 

i=0 



■t-1- 



j.t-2 



where ji = ^ kp with 70 = 1. Now 7(0) = i*-^ and 7(-t) = (-1)*+^^* 

and hence 7(0) 7^ 7^ li^t). 

Let z be a root of A{Z) in some overfield of k{y). Then solving A(z) = for y 
we get 



(*) 



{z + tY^{z) 



and so in particular we see that z is transcendental over k. Consider the valuation 
K : z = of k{z)/k and let ( be an extension of it to k{y,z). Since t ^ Q ^ 7(0)) 
by the above equation we see that 



(« + t)C(y) = C(y''+') = C 



{z + t)<i^{z) 



r(C : k)k 



{z + ty^iz) 



f{C : K)t. 



Since q + t and t are coprimc, we must have f(C : k) > q + 1. Therefore [k{y, z) : 
k{z)] > q + t and hence by the above equation for we get [k{y, z) : k{z)] = q + t. 

Since [k{y,z) : k{z)] = q + t, by the above equation for we see that the 
polynomial (z + t)'^7(z)y+* — f/'^'^ z* is irreducible in Since the nonzero 

polynomials {Z+t)'^^{Z) and t'^~'^Z* in Z with coefficients in k have no nonconstant 
common factor in k[Z], we conclude that the polynomial 

{z + tyj{z)Y''+* -f-'^z* 

is irreducible in k[Y, Z]. Consequently the polynomial {Z + f)'^^{Z)y'^^^ - t'^~'^Z* 
is irreducible in k{y)[Z], and hence the polynomial A.(Z) is irreducible in k{y)[Z].^^ 
Therefore the polynomials A{Y) and A{Y) are irreducible in k{y)[Y], and hence 



®*This also follows from the First Irreducibility Lemma. In fact, the above argument is a 
special case of the argument used in the proof of the said lemma. 
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Gal(A, k{y)) = Gal(A, k{y)) = a transitive permutation group of degree n — 1. Since 
this group is the one-point stabihzer of Gn,q = G&\{F n^q,k{x)) , we conclude that: 
Gn,q is 2-transitive. 

As we have noted above, if q = p then Gn,p contains a p-cycle, and hence by a 
1873 Theorem of Jordan wc sec that if also t > 2, then Gn,p = An or Sn- In Result 
4 on page 841 of the 1957 paper [A3] we have noted that, as a consequence of the 
genus formula, every member of the algebraic fundamental group of the afRne line 
over an algebraically closed ground field of characteristic p ^ is a quasi p-group. 
Also obviously, for n > 1 and p ^ 2, the symmetric group Sn is not a quasi p-group. 
Therefore if k is algebraically closed and q=p^2 and n= p + t with t ^0{p) and 
t > 2, then Gn.p = the alternating group An- 

Here is the 1873 Theorem of Jordan [J3] we spoke of; proofs can also be found in 
13.9 on page 39 of Wielandt [Wi] and in 3.7 on page 331 of volume III of Huppert- 
Blackburn [HB]. 

Jordan's Theorem. // a primitive permutation group G of degree n = p + t 
contains a p-cycle, where p is prime and t > 2, then G = An or Sn-^^ 

In the case p = 2, a p-cycle is simply a transposition, and obviously Sn is the 

only 2-transitive permutation group of degree n which contains a transposition.™ 
Therefore if q = p = 2 and n = p + t with t ^ 0{p) and t > 0, then Gn,p = the 
symmetric group Sn- 

To throw away a second root of Fn.q, or more precisely to throw away a root of 
A(Z), this time around let prime denote the twisted .Z-derivative at z. Then 

A'{Z) = [{Z + ty^iZ)]' - (by linearity) 

= {Z + z + tyj'{Z) + Zi-'^j{z) - [Zj (by power product rule) 

= {Z + z + tyi{Z) + Zi-^j{z) - j{z){z + tyz-'iZ']' (by (*)) 
= {Z + z + tyj{Z) + j{z)Z'^-' - ^{z){z + t^z-'pt-iiZ), 

where pj (Z) is the monic polynomial of degree j > in Z with coefficients in the 
prime field kpdk given by 

and 

t-2 

7(^) ^^lt-i-3Pj{Z). 
Let $(W^) be the monic polynomial of degree g -|- f — 2 in an indeterminate W 



^^A 2-transitive permutation group is automatically primitive. 
A 2-transitive permutation group which contains a transposition, must automatically contain 
all transpositions, and hence must be the symmetric group. 
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with coefficients in k{z) obtained by multiplying the roots of A'{W) by z. Then 

\ 91 r /w \ 1 

+ j{z)z*-'^ 



W 

— +z + t 
z 



9-1 



-^{z){z + tYz' 



<ly<l-t-'^ 



Z Pt-1 



and hence 

4>(VK) = {W + + tzY^iW) + 7(z)z*-iT4^'?-i - 7(z)(z + i)«z«-*-Vt-i(W^) 

where /5j (W^) is the monic polynomial of degree j >0 uiW with coefficients in k{z\ 
given by 



and 



7(W^)=^z*-2-i7t-2-i/5,(W^). 
j=o 

Note that if t = 1 then ^{W) = O/^ whereas if i > 1 then j{W) is a monic 
polynomial of degree t — 2 in W with coefficients in k[z], and hence, in particular, 
ift = 2then7(W) = l. 

For a moment suppose that t = I; then 'yiZ) is a monic polynomial of degree 
t — 1 = in Z and hence ^{Z) = 1 and therefore 7(2) = 1; also as noted above 
■y{W) = 0; finally pt-i{W) is a monic polynomial of degree t — 1 = in W and 
hence pt-i{W) = 1. Thus, 



(1*) 



r A{Z) = {Z + ly - y-'i-'^Z and 



Next, for a moment suppose that t = 2; then 

7(Z) = [(Z + 2f - Z'^]= Z +1 

and hence j{z) = z+1; also as noted above j{W) = 1 and clearly {W + z^ +tz)'' = 
+ (z + 2)92;9; finally /5t_i(VF) is the monic polynomial of degree 1 in W given 
by pt-i{W) = W + 2z^ and hence 

$(0) =(z + 2)9z« - (2 + l)(z + 2)«z9-3(2z2) 

=(z + 2yzi-\z -2z-2) = -{z + 2)«+iz«-^ 



Thus, 



(2*) 



t = 2 



f A{Z) = {Z + 2Y{Z + 1) - 29-2j/-9-2z2 
^{W) = W'' + {z + l)zW'^-^ -{z + l){z + 2yz'i-^W 



+ 2) 



1 Because in that case the summation in the above expression for 'y(VK) is empty. 
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In the case of general t, by looking at the equation (*), we get the following 
"ramification diagram" for the field extensions k{x) C k{y) C k{y,z), where the 
square bracketed numbers are the reduced ramification exponents. 



Woe ■■ y ■ 



oo[tl— 



■ (i : ViCz) = 0[1] for 1 < i < t* - 1 



■Ct' : =0[?] 



wo -.y = 0[q] - 



•Co 



0[t] 



■ Coo.j ; z = oo 

for 1 < i < t" 



") = 0[^ 



To explain the top half of the extreme right hand side in the above diagram, first 
note that, since t ^ 0(p), the Z-discriminant of {Z + t~^Y — t~*^ € k[Z] is a nonzero 
constant; now is a root of this monic polynomial and by reciprocating the roots 
of {{Z + t~^y — t~*/Z we get j{Z); therefore ^{Z) is free from multiple factors; 
since 7(— f) ^ 0, we conclude that jiZ) = ip\{Z)ip2{Z) ■ ■ -ipf-iiZ), where, upon 
letting ipf (Z) = Z + t, wc have that -ijji (Z), t]j2{Z), . . . , ipf {Z) arc pairwisc distinct 
nonconstant monic irreducible polynomials in Z with coefficients in k; let Kj be the 
valuation of k{z)/k given by ipi{z)\ since GCD(g + t,q) = 1, by (*) we see that Kj 
has a unique extension d to k(y, z), and Ci, C2, • • • , Ct* a-re exactly all the extensions 
of Woo to k{y, z), and we have f{Q : Ki) = q + 1 foY 1 < i < t* , f(Ci : Woc) = 1 for 
1 < i < t* - 1, and f(Ct* : Woo) = q- 

Turning to the bottom half, since GCD{q + t,t) = 1, by (*) wc sec that the 
valuation kq : z = oi k{z)/k has a unique extension Co to k{y,z); since t ^ 0(p), 
upon letting t' = GGD{q + t,q-l) we have Z*' - t"''^ = €i{Z)e2{Z) . . . et"{Z), 
where ei{Z), e2{Z), . . . , etn[Z) arc pairwisc distinct nonconstant monic irreducible 
polynomials in Z with coefficients in k; now by (*) we see that the valuation 
of k{z)/k has t" extensions Coo.i, Coo,2, • • • , Coo,t" to k{y,z) which 
are characterized by saying that Cx.j{cj{j/'^^'^^'' z}-'^'^^^!*- )) > for 1 < j < t" , and 
moreover Coi Coo.i, Coo, 2, • ■ • , Coo,t" are exactly all the extensions of wq to k{y, z), and 
we havef(Co : kq) = g + f(Co : wq) = t, r(Coo,i : Koc) = {q + t)/t' for 1 < j < t" , 
and f(Cooj : wo) = (g - l)/f' for l<j< t" . 

Finally note that, since Voo is the only valuation of k{x) which is ramified in 
k{y), no valuation of other than wq and Wqo, can be ramified in k{y, z), and 
no valuation of k{y, z), other than Ci- C2, • • • , Ct* , Co, Coo, 1, Coo,2, • • • , Coo.t", can be 
ramified in the splitting field of ^{W) over fc(t/, z). 

Now, as noted above, if t = 1 then ^{Z) = 1, whereas if t = 2 then j{Z) = Z + 1, 
and hence, in connection with the top half of the above diagram, we have that 



i = 1 ^ y' 



(1') 
whereas 
(2') t = 2^ 



9+1 



(^+1)9 



, and t* = 1 and ipi{z) = z + 1 



21-^z^ 



{z + 2)i{z + l) 



, t* = 2, ipi{z) = z + 1, and ip2{z) = z + 2. 
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Concerning the bottom half of the above diagram we note that 

/2//-) ^ ^ 2 ^ LCM(f(Co : wp), f(Coo,i : wp), . . . , f{Coo,t" : wp)) 

r{Co ■■ Wp) 

^ r £^ if 1(3), 

if 1(3). 

Recall that a permutation group is said to be semiregular if its stabilizer, at any 
point in the permuted set, is the identity; in our terminology this is equivalent to 
1-antitransitive. By analogy, let us say that a univariate monic polynomial, with 
coefficients in some field, is semiregular over that field if by adjoining any one of its 
roots we get the splitting field. Note that then, assuming the roots to be distinct, 
the polynomial is semiregular if and only if its Galois group is semiregular. 

For a moment suppose that t = 1 and k contains all the {q — l)th roots of 1. 
Then by (1*) wc sec that ^{W) is semiregular over k{y,z) and hence its Galois 
group is 1-antitransitive; since this group is the 2-point stabilizer of the 2-transitive 
permutation group Gn,q, we conclude that Gn,q is a (2,3) group. Now if p = 2 
then Ci{{z + l)«z9"2) = g(g + i) and GCD(g - 1, q{q + 1)) = 1, and therefore by 
(1*) we see that <1>(VK) is irreducible in k{y,z)[W] and its Galois group is a cyclic 
group of order q — 1, and hence in particular the said Galois group is a (1,1) group; 
consequently, Gn,q is a (3,3) group. On the other hand, if p ^ 2 then by (1') we 
have {z + l^z'^'^ = z'i-^y-^'i+^^ and hence by (1*) we get 

^{W) = (W(9-l)/2 _ _^(9-l)/2y-(g+l)/2)^p^(5-l)/2 ^ _^(q-l)/2y-(q+l)/2^ 

and also 



and 



GCD 



2 ' 2 ' ' 



and hence the above two factors of $(H^) are irreducible in fc(?/,2;)[W] and the 
Galois group of <I>(VF) over k{y,z) is a cyclic group of order Thus, if t = 1 

and k contains all the {q — l)th roots ofl, then Gn,q is a (2,3) group, and moreover 
if also p = 2 then Gn,q is a (3,3) group and its order is {q + l)q{q — I), whereas if 
p 7^ 2 then Gn,q is not a (3,3) group and its order is l2±lM£i:ii. 

Therefore by an obvious corollary of the Zassenhaus-Feit-Suzuki Theorem we see 
that ift = l and k contains all the {q — l)th roots of 1, then Gn,q = PSL(2,q') with 
the possible exception that for q = 7 we may have Gn,q = ArL(l,8). 

The said corollary may be formulated in the following manner. For deducing 
this corollary from the Zassenhaus-Feit-Suzuki Theorem, the only thing we need to 
check is that if the (degree, order) pair (2', 2'(2' - 1)1) of a Feit Group ArL(l, 2'), 
where I is a prime, equals {q + 1, (M+2M^ j then I = {2^ - 2)/2 = 2^'^ - 1 and by 
direct calculation we see that this is not possible for I = 2, but is possible for i =3, 

and is never possible for / > 4 because then 2'"^ -1 = 1 + 2 + 2^ H h 2'~^ > 

1 + 2(/ - 2) = / + (/ - 3) > Z. 
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Corollary of the Zassenhaus-Feit-Suzuki-Theorem. If p = 2, i.e., if q is a 
positive power of 2, then PSL(2, q) is the only (3, 3) group of degree q+l. If p ^ 2, 
i.e., ifq is a positive power of an odd prime p, then PSL(2, q) is the only (2, 3) group 
of degree q+l and order luith the exception that, in the case q = 7, the 

group ArL(l,8) also satisfies this description. 

Now, on the one hand, by Result 4 on page 841 of [A3] we know that, in case 
k is algebraically closed, Gn,q is a quasi p-group, and, on the other hand, for the 
group ArL(l,8) = GF(8) x rL(l,8) we have rL(l,8) = GL(1,8) >^ Aut (GF(8)) 
and hence Aut (GF(8)) = Z3 is a homomorphic image of ArL(l,8) and therefore 
ArL(l, 8) is not a quasi 7-groupJ^ Consequently, in case k is algebraically closed, 
the said exception cannot occur and hence if k is algebraically closed and t = 1, 
then Gn,q = PSL(2, q) = PSL(2, n - 1). 

Before proceeding further, let us record the following 

Fourth Irreducibility Lemma. Remember that p is a prime number, and t is 

a positive integer^^ For < i < t, let ,fi{Y) and gi{Y) be manic polynomials of 
degree p + t — i in Y with coefficients in a field Ki, such that gi(Y) is obtained 
from fi{Y) by making one or more of the three operations of multiplying the roots 
by a nonzero quantity in Ki, reciprocating the roots,'^'^ and decreasing the roots by 
a quantity in Ki. Assume that, for 1 < i <t, the polynomial fi{Y) is obtained by 
throwing away a root ai of gi-i{Y), and for the field Ki we have Ki = Ki-i{ai). 
Also assume that fo{Y) has no multiple roots 7^ Finally assume that there exist 
valuations u*^*) and u*^*) of Ki for <i <t such that u'"' = for 1 <i <t, the 
valuation u^'^ is an extension ofu^^~^^ to Ki] for 1 < i <t, the valuation u^*^ is an 
extension ofu^^~^^ to Ki; for every i with 1 <i <t we have f(u^'^ : u^^~^^) ^ 0(p); 
and for some j with 1 < j < t we have f{u'^^'> : u^^"^)) = 0{p). 

Let u' be an extension of u'^'^^ to a splitting field K' of gtiY) over Kt, and let 

r* = n*=i ^('^^^ ^*'"^^) ^' = ^^*^)- u^*' = . . be all 

the extensions ofu^*~^^ to Kt, and let 

„ _ LCM(f(Mf^ : . . . ,r(Mf : n^*"^))) 

Finally let tS^\ . . . ,u^^\u' be any valuations of Kq, . ■ . , Kt, K' respectively, with 
uC^) = u^^\ such that is an extension of u^^~^^ for I < i < t, and u' is an 
extension of u^^^ for 1 < i < t, and let f* = 11^=1 ^('"^'^ • and f' = f{u' : 

Then the polynomial gt{Y) is irreducible in Kt[Y], and we have \ Gal{gt, Kt)\ = 
O(r') andr' = 0(r") andf*f' = r*r' . Moreover, ifgt-\iY) is irreducible in Kt-\\Y\, 

and the residue field ofu^*^^ is an algebraically closed field of the same characteristic 
as Kq, and f{uj : u*^*"^^) ^ 0(chari4ro) for 1 < j < S, then r' = r" . 

To see this, take an extension u' of m*^*' to K'. By assumption 

f{u^^^ : u^^~^^) = 0{p) for some j with 1 < j <t 

'^^A quasi p-group can be characterized as a finite group having no homomorphic image whose 
order is prime to p and greater than 1. 

'^^In this lemma, t is allowed to be divisible by p. 

''^In case of reciprocating the roots, we of course assume that zero is not a root. 
^^In any field extension of Kq. 
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and clearly f{u' : it*^°)) = r*r', and hence we must have f(u' : u^"^) = 0(p). Now 
both the valuations u' and u' are extensions of = u^^^ to K' which is a Galois 
extension of KqJ^ and hence f{u' : u^^^) = f{u' : u^"^), and therefore f{u' : u^^^) = 
0(p). By assumption f(u(*^ : m^*"-*^^) ^ 0{p) for 1 < i <t and clearly f{u' : ii'-^^) = 
f{u^^^ : M^"^) . . . f(u(*) : u(*~-'^^)f(u' : w^*^), and hence we must have f{u' : w*^*') = 
0(p). By assumption i^' is a splitting field of S(t(^) over Kt and hence | Gal(,g(, Kt)\ 
must be divisible by f{u' : m*-'-*), and therefore | Gal{gt, Kt)\ = 0{p): since gtiY) is a 
monic polynomial of degree pinY with coefficients in Kt, we conclude that gtiY) 
is irreducible in ^^([l']. Since K' is a splitting field of gt{Y) over Kt, we also get 
I Gal(gf , Kt)\ = O(r'). Now K' is a Galois extension of Kt^i, and hence r' = 0(r"). 
Since i^' is a Galois extension of Kq, and u' and ?i' arc extensions of u^^^ = v/-^^ to 
K' , we also get f*f' = f(M' : fl'^'^^) = f(u' : u'-"-') = r*r' . If gi_i(y) is irreducible in 
ii't_i[y] then K' is a least Galois extension of Kt-i containing Kt, and hence the 
last assertion follows from Proposition 7 on page 845 of [A3]. 

Note that this lemma gives an alternative proof of the irreducibility of A(Y) in 
case of q = p and t = 1. In fact we have the following 

Corollary of the Fourth Irreducibility Lemma. If k is algebraically closed 

and F{Y) is a monic irreducible polynomial of degree p+l inY with coefficients in 
k{X) such that no valuation of k{X)/k, other than the valuation given by X = oo, 
is ramified in the root field of F{Y) over k{X), then the Galois group Gal(i^, k{X)) 
is 2-transitive. 

To deduce this from the lemma, it suffices to note that, in view of Proposition 6 
on page 835 of [A3], in the root field of F{Y) over k{X), the valuation X = oo must 
split into two valuations with reduced ramification exponents p and 1 respectively/'' 

More interestingly, in view of the ramification diagram and implication (2"), 
by the above lemma we see that if q = p and n = p + t with t = 2 ^ 0{p), 
then ^(W) is irreducible in k{y,z)[W] and so Gn,p is 2>-transitive, and the order 
of its S-point stabilizer Gal($, k{y, z)) is divisible by t, where r = or t = 
according as p =1(3) or p ^ 1(3), and hence the order of Gn,p is divisible by [p + 
2)(p+ l)pT, and moreover the reduced ramification exponent of any extension of C,q 
{resp. Cii C2, Coo.i, • • • ) Coo.t") to a splitting field of^{W) over k{y, z) is divisible by r 
{resp. pr, r, 1, . . . , 1), and, in case k is algebraically closed, the reduced ramification 
exponent of any extension of ^0 [resp. ^i, C2, Coo.i, • • • , Coo.t") to a splitting field of 
^{W) over k{y, z) equals r {resp. pr, r, 1, . . . , 1)7^ 

Therefore by the following Corollary of CTT we see that if q = p and n — p + t 
with t = 2^ 0(p), and if Gn,p is neither CQUdl to A.yi nor equal to Sn^'^ then either 
p = 1 and Gn,p = PSL(2,8), or p = 7 and G„,p = PrL(2,8), or p = 31 and 
Gn,p = PTL{2,32). 

Corollary of CTT. Let G be a 3-transitive permutation group of degree n = p + 2 
with an odd prime p, such that G is neither equal to An nor equal to 5„. Then 



Because it is a splitting field of fo{y) over Ko- 

'^^Alternatively, in view of Proposition 6 on page 835 of [A3], this Corollary can be deduced 
from the fact that if G is a transitive permutation group of degree p + 1 of order divisible by p 
then G must contain a p-cycle and hence it must be 2-transitive. 

''^In the present case, by (2') we have t* = 2. 

'^^That is if, as a permutation group, it is not isomorphic to either of these. 
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p = 2^' - I for som.e prima and either G = PSL(2,2'') or G = PrL(2,2''). 
Moreover, if the order of G is divisible by {p + 2){p+ l)pT, where r = or 
according as p = 1(3) or p ^ 1(3), then either p = 7 and G = PSL(2, 8), or p = 7 
andG = PTL{2,8), orp = 31 and G = PTL{2,32). 



To deduce the above Corollary from CTT, by inspection wc see that no group 
listed in items (3) to (8) of CTT has degree of the form p + 2 with an odd prime 
p. Moreover the degree of every group listed in items (1) and (2) of CTT is of the 
form tt'' + 1, with prime n and positive integer /i, and if we have tt^ + 1 = p + 2 
then, in case of odd tt, we would get the contradiction p = tt^ — 1 = 0(2). Therefore 
by CTT we conclude that p = 2** — 1 for some prime fi, and G is a group between 
PSL(2,2^) and PrL(2,2^). Since fi is prime and PSL(2,2'') is a normal subgroup 
of PrL(2, 2^) of index /z, there are no groups strictly between PSL(2,2'') and 
PrL(2,2^). Therefore G = PSL(2,2^) or G = PrL(2,2''). Now if the order of 
G is divisible by {p + 2){p + l)pT, then {p + 2){p + l)p^J, = |PrL(2,2^)| > |G| > 
{p + 2)(p + l)pT and hence jj. > t > ^ = {2^' - 2)/6 and therefore 2^"! - 1 < 3^^; 
on the other hand, if /z > 5 then 2''-! - 1 = (1 + 2 + 2^) + (2^ + 2* + • • • + 2^'^) > 
7 + 8(^ — 4) = 3/z + 5(/i — 5) > 3/U. Therefore if the order of G is divisible by 
(p + 2){p + l)pT, then we must have /i = 2 or 3 or 5, i.e., p = 3 or 7 or 31; now the 
case p = 3 is ruled out because then PSL(2, 2^) = A„ and PrL(2, 2'^) = Sn with 
11 = 2 and n — h\ also, in case of p = 31, wc cannot have G = PSL(2,32) because 
|PSL(2,32)| = 33-32 -31 < 33-32 • 31 • 5 = (p + 2)(p + l)pT. 

Once again by Result 4 on page 841 of [A3] we know that, in case k is algebraically 
closed, the group Gn,q is a quasi p-group, and obviously, in the case p 7^ 2, the group 
Sn is not a quasi p-group because it has Z2 as a homomorphic image, and likewise, 
in the case of a Mersenne prime p = 2^ — 1, with a prime /i, the group P F L(2, 2'') 
is not a quasi p-group because it has as a homomorphic image. Therefore by 
the above italicized conclusion we see that if k is algebraically closed and, q = p and 
n = p + t with t = 2 ^ 0{p) , then in the case p ^ 7 we have Gn,p = ^n, whereas in 
the case p = 7 we have Gn,p — An or PSL(2, n — 1). 

Now let IV be an element in an overfield of k{y, z) such that 



(**) $(w) = 0, 



and let '^{U) be the monic polynomial of degree g — 1 in an indeterminate U with 
coefficients in k{y,z,iv) obtained by throwing away the root u) of ^{U). Actually, 
$(f/) e k{z)[U] and hence *(?7) e k{z,u;)[U]. 

To compute ^{U), let prime stand for the twisted f -derivative at u; then we 
have *(f/) = $'(f/), and by the prime power rule we have (W)' = f/'~^, and by 



Recall that a prime p is called a Mersenne prime if it is of the form 2*^ — 1 for some positive 
integer n, and then automatically n is itself a prime number. 
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the power rule we have (U^)' = and (U)' = 1, and by direct calculation we have 





;-i +(1+^)^(1 +1- 
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and therefore, in view of (2*), by linearity we see that 

r ^-(?7) = + {z + l)z[U''-^ - wU"-^ + a;2[/«-^ + u^-^U] 

\ -(^ + l)^[w«-2 + (^ + 2)«^«-^]. 



t = 2=> 



For a moment suppose that fc is algebraically closed and q = p = 7 and t = 2. 

Since is 4-transitivc and PSL(2, 8) is sharply 3-transitivc, by the above italicized 
assertion it follows that Gn,p An Gn,p = PSL(2,n — 1) <^ '^{U) is reducible 
in k{y, z,oj)[U] <^ ^'(C^) completely factors (into linear factors) in k{y, z,uj)[U]. To 
convert the question of reducibility of "^{U) over the field k{y, z, lu) to its reducibility 
over the simpler field k{z,uj), first note that k{z,uj) C k{y,z,ui) and hence ^'(i7) 
is reducible in k{z,uj)[U] =^> vE'(C/) is reducible in k{y, z,uj)[U]; next, by (2') we see 
that k{y, z, oj) is a cyclic extension of k{z, lu) of degree f or 3 or 9, and hence ^'(C/) 
completely factors in k{y, z, io)[U] the degrees of the irreducible factors of \E'(J7) in 
k{z,ui)[U] are (3,3) or (3,1,1,1) or (1,1,1,1,1,1). Therefore ifk is algebraically closed 
and q = p = 7 and n = p + t with t — 2 then Gn,p ^ j4„ <^ Gn.p = PSL(2, n — 1) <^ 
^'(C/) is reducible in k{y, z,u>)[U] <J4> 4'(f7) completely factors {into linear factors) 
in k{y, z,uj)[U] '^{U) is reducible in fc(z,w)[J7] the degrees of the irreducible 
factors of ^{U) mk{z,uj)[U] are (3, 3) or (3, 1,1,1) or (1, 1, 1, 1, 1, 1). 

To further simplify the question of reducibility of ^'(J7), we proceed to eliminate 
the last two possibilities of (3,1,1,1) and (1,1,1,1,1,1) for the degrees of the 
irreducible factors of '^{U) in k{z,uj). Afterwards, we shall also indicate how the 
question of reducibility of ^'(J7) in k{z, ui)[U] can be converted to its reducibility in 
kp{z,ui)[U]. 

Recall that 7(^) is a monic polynomial of degree t — 1 in Z with coefficients in 
the prime field kp C k. Let 



and 



A*{Z) = {Z + t)'^j{Z) - y*Z' e k,{y*)[Z]. 



Now the element y* is transcendental over k, the polynomial A*{Z) is irreducible 
in k{y*)[Z], the element ^; is a root of A*{Z), the polynomial A'{Z) G kp{z)[Z] is 
obtained by throwing away the root z of A*{Z), the polynomial ^{W) G fcj,(2:)[W] 
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is obtained by multiplying the roots of A'{W) by z, the element a; is a root of 
^{W), the polynomial 5'(C/) € kp{z,co)[U] is obtained by throwing away the root w 
of ^{U), and finally if q = p and t = 2 then the polynomial is irreducible in 

k{z)\W\ and hence G&\{h* ,k{y*)) and Gal( A*, fcp(y*)) are 2-transitive permutation 
groups of degree p+l. 

For t = 2 we have "i{Z) = Z +1 and so 



(2***) t = 2 



y 



and A*{Z) = {Z + 2f{Z + 1) - y^Z"^. 



If q = p and t = 2 then either directly by the above expression for y* , or 
indirectly by the formulae (*) and (2') and the ramification diagram of the field 
extension k{y) C k{y,z), we see that Wq : y* = and : y* = oo are the only 
valuations of k{y*)/k which arc ramified in k(z), the valuation Wq splits into the 
valuations Ki : z + 1 = and K2 : z + 2 = of k{z)/k with reduced ramification 
exponents f{K\ : Wq) = 1 and f{K2 ■ Wq) = p, and the valuation splits into 
the valuations kq : z = and of k(z)/k with reduced ramification 

exponents f{Ko : w^) = 2 and f(K;oo ^ w^^) = p — 1. Thus, in case of q = p and 
f = 2 we have the special ramification diagram for the field extension k{y*) C k{z) 
where the square bracketed numbers are the reduced ramification exponents: 



■■ y 



00- 



Ki:z + l = 0[1] 



K2 : ^ + 2 = 0[p] 
Ko:z = 0[2] 



: z = oo[p — 1]. 



For a moment suppose that q = p and t = 2. Since the degree of is p, by 

the above diagram we see that ki has a unique extension Ai to k{z,Lu), for Ai we 
have f(Ai : Ki) = p, and the reduced ramification exponent of any extension of Ai 
to a splitting field of '^(U) over k{z,uj) equals the reduced ramification exponent 
of any extension of K2 to the said splitting field. By direct calculation with the 
expression of ^{W) given in (2*), it can be shown that K2 has 3 extensions to 
k{z,Lu) with reduced ramification exponents Therefore, say by the 

Fourth Irreducibility Lemma, the reduced ramification exponent of any extension 
of Ai to the said splitting field must be divisible by . Consequently the degree 
of some irreducible factor of ^{U) in k{z,u))[U] must be > The said direct 

calculation will not be given here,^^ and hence, for deducing the last consequence, 
at least in the case when k is algebraically closed, let us indirectly argue in the 
following manner. 

So for a moment suppose that k is algebraically closed and q = p and t = 2. 
Now in view of MRT,*^ by implication (2') and by the paragraphs "Finally note 



*^But it will be given in my forthcoming paper [A7]. 
*2mRT = Abhyankar's Lemma = Pages 181-186 of [A5]. 
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that..." and "More interestingly...", we see that ki,K2,ko arc the only valuations 
of k{z)/k which are ramified in a splitting field of (f>(M^) over k{z), and the reduced 
ramification exponent of any extension of ki (resp. K2, ko) to the said splitting field 
is £^^Ili (resp. ^,^'). Since the degree of ^{W) is p, it follows that ki has 
a imiquc extension Ai to k{z.^'), for Ai wc have f(Ai : ki) = p, and the reduced 
ramification index of any extension of Ai to a splitting field of "^{U) over k{z, uj) is 
Therefore the degree of some irreducible factor of '^(U) in k{z,u!)[U] must be 

> 

— 2 • 

Finally for a moment suppose that k is algebraically closed and q = p = 7 
and t = 2. Then by the above paragraph and the paragraph following (2**) we 
see that '^'{U) is reducible in k{z,uj)[U] <^ the degrees of the irreducible fac- 
tors of *([/) in k{z,Lu)[U] are (3,3) or (3,1,1,1). Thus, *([/) is reducible in 
k{z,u))[U] Gal(A*, fc(y*)) is 2-transitive but neither 3-transitive nor sharply 2- 
transitive. Therefore, since AGL(1,8) is sharply 2-transitive, by the Special CDT 
we conclude that *([/) is reducible in k{z,Lu)[U] O Gal(A*, A;(y*)) = PSL(2, 7) or 
A r L(l, 8); by the Zassenhaus-Feit-Suzuki Theorem, both of these are (2,3) groups, 
and hence '^{U) is reducible in k{z,uj)[U] ^ the degrees of the irreducible factors 
of '^{U) in k{z,ui)[U] are (3,3) and they have a common splitting field over k{z,Lo) 
with Galois group Z^. It follows that ^'(J7) is reducible in kp{z,co)[U] <^ the de- 
grees of the irreducible factors of *(t/) in kp{z,LL!)[U] are (3,3) <^ Gal(A*, fcp(y*)) 
is 2-transitive but neither 3-transitive nor sharply 2-transitive O Gal(A*, fcp(y*)) 
is not 3-transitive; therefore by CTT, Special GDT, and the Zassenhaus-Feit- 
Suzuki Theorem, we see that "${11) is reducible in kp{z,uj)[U] <^Gal{A* , kp{y*)) = 
PSL(2,7) or ArL(l,8) Gal(A*, A;p(y*)) ^ {^g, A8,AGL(3, 2), PGL(2, 7)}. 

Let us close this long section with a 

Summary about the bar polynomial. Let k be an algebraically closed field 
of characteristic p ^ 0. Let q he a. positive power of p and let n = q + t where 

t is a positive integer with t ^ 0{p). Let F„,, = y« _ XF* + 1 G k[X,Y] and 
let Gn,q = Gal{Fn,q,k{X)). Without using CT we have shown that if t = 1 
then Gn,q = PSL(2,(j') = PSL(2,n — 1), whereas if q = p and t > 2 ^ p then 
Gn,q = An, and finally if q=p and t> 2 = p then Gn,q = Sn-^^ Using CT we have 
shown that if q = p ^ 7 and t = 2 then Gn,q = ^n- Using CT, and referring to 
(2*), (2'), (**), (2**), (2***) for notation and remembering that z is transcendental 
over fc, we have also shown that if q — p = 7 and t — 2 then Gn,q 7^ An <S=> Gn,q = 
PSL(2,n- 1) <^ ^{U) is reducible in fc(y, z,ijj)\U] <^ ^{U) completely factors (into 
linear factors) ink{y, z,ijj)\U] '^{U) is reducible in A;(2:, u;)[J7] <^ the degrees of the 
irreducible factors of ^{U) in k{z, w) \U] are (3, 3) and they have a common splitting 
field over k{z,uj) with Galois group Z3 ^ Gal{A* , k{y*)) = ArL(l,8) ^ *(t/) 
is reducible in kp{z,uj)[U] 4^ the degrees of the irreducible factors of ^{U) in 
kp{z,Lo)[U] are (3,3) ^ Gal(A*, fcp (?/*)) = ArL(l,8). 

Remark. To establish the above long chain of equivalences, we still need to prove 
that 

(') G9,7 = PSL(2, 8) ^ Gal(A*, fc(y*)) / PSL(2, 7) 



Since PSL(2, 2) = S3, it follows that if g = p = 2 then G„,q = S, 
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and 

(") Gal(A*,fc(y*)) 

= ArL(l, 8) => Gal(A*, kp{y*)) ^ {Ss, As, AGL(3, 2), 

PGL(2,7),PSL(2,7)}. 

Now Gal(A, A:(y)) and AGL(1,8) are the 1-point stabilizers of Ggj and PSL(2,8) 
respectively, and hence to prove (') it suffices to show that 

("') Gal(A, k{y)) = AGL(1, 8) ^ Gal(A*, k{y*)) ^ PSL(2, 7) 

Let / be a nonconstant univariate monic polynomial with coefficients in a field K 
having no multiple roots in any field extension of K, and let K* be a (finite) Galois 
extension of K. By (2') and (2***) wc sec that, in the case q = p = 7 and t = 2, the 
field k{y) is a Galois extension of the field k{y*) and hence, by taking / = A = A* 
and {K,K*) = {k{y*) , k{y)) , implication ("') follows from the implication 

(tt) Gal(/, K*) = AGL(1, 8) ^ Gal(/, K) ^ PSL(2, 7). 

Also clearly for some finite algebraic field extension k* of kp wc have that Gal(A*, k*{y*)) = 
Gal(A*, fc(y*)) and k*{y*) is a Galois extension of kp{y*) and hence, by taking 
/ = A* and {K,K*) = {kp{y*),k*{y*)), implication (") follows from the implica- 
tion 

(tttt) Gal{f,K*)=ATL{l,8) 

^ Gal(/, K) ^ {Ss, As, AGL(3, 2), PGL(2, 7), PSL(2, 7)}. 

Now AGL(1, 8) is a nonidentity solvable group and PSL(2, 7) is a nonabelian sim- 
ple group, and hence AGL(1, 8) is not isomorphic to a normal subgroup of PSL(2, 7), 
and therefore (jj) follows from the Refined Extension Principle. Likewise, ArL(l, 8) 
is a nonidentity solvable group but all the nonidentity normal subgroups of {Ss, 
AGL(3, 2), PGL(2, 7), PSL(2, 7)} arc nonsolvable because they respectively contain 
the nonabelian simple group {Ag, Ag, PSL(3, 2), PSL(2, 7), PSL(2, 7)} as a normal 
subgroup, and hence A F L(l, 8) cannot be isomorphic to a normal subgroup of any 
one of the groups {S'g, Ag, AGL(3, 2), PGL(2, 7), PSL(2, 7)}, and therefore (Sll) also 
follows from the Refined Extension Principle. 

Note. Assuming q = p = 7 and t = 2, in my forthcoming paper [A7], first by 
resolving the singularities of the curve $(a;) = in the (^;, a;)-plane I calculate its 
genus in terms of infinitely near singularities, and then, seeing that the genus is 2 
and hence the curve is hyperelliptic, by means of adjoints I express it as a double 
covering of the line, and finally, using the resulting square-root parametrization of 
the curve I show that the polynomial ^{U) factors in k{z,uj)[U] into two factors of 
degree 3, and so I conclude that Ggj = PSL(2,8). Initially I used the square-root 
parametrization to take the "norm" of '^{U) which is a monic polynomial ^'"(7, U) 
of degree 12 in ?7 with coefficients which arc polynomials in T over the prime field 
kr. I used REDUCE and MACSYMA to calculate *«(r, U); the largest coefficient 
degree in T turned out to be 216, and it took 10 pages to print out the exact 
expressions of the coefficients. Prom what I have said above, it follows that ^'(J7) 
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is reducible in k(z,oj)[U] ^ ■^HTM) is reducible in kr[TM] ^ *f(r, C/) factors 
in kr[T,U] into two polynomials of degree 3 in [/. Although the last two tasks 
are finitistic in nature, the computer algebra packages REDUCE and MACSYMA 
refused to factor bivariate polynomials over a finite field! So I turned back to hand 
calculation in a hyperelliptic function field, using MACSYMA only for checking 
ordinary polynomial operations !! 

22. The roof polynomial and decreasing induction 

Let X and Y be indeterminates over an algebraically closed field k of charac- 
teristic p ^ 0, let ^ a G k, let s and t he positive integers with t ^ 0(p), and 
consider the polynomial 

F„ = y" - aX'Y* + 1 with n=p + t 

mentioned in §11. This corresponds to the q = p case of the more general polynomial 

Fn,g = F" - aX'Y* + 1 

with n = q + t and q = a positive power of p which was also mentioned in §11. The 
polynomials F„ and in turn may be obtained by substituting aX^ for X in the 
polynomials 

Fn,g = y" - XF* + 1 

with n = q + t and q = a, positive power of p and 

F„ = y" - Xy' + 1 with n=p + t. 

As noted in the beginning of §21 , the y-discriminant of the polynomials _F„ and Fn^q 
is the nonzero element n" of k, and hence the y-discriminant of the polynomials 
Fn and is also the nonzero element n" of k, and therefore each one of the four 
polynomials Fn, Fn q, Fn, Fn q gives an unramified covering of the afRne line over 

In the beginning of §21, we noted that the polynomials F„ and Fn,q are irre- 
ducible in k{X)\Y], and considering the Galois groups G„ = Gal(F„,fc(X)) and 
Gn,q = Gal(f „,g, fc(X)), as summarized in the Summary and the Note at the end 
of that section, in the rest of that section we proved the following.^^ 

(I.l*) If i = 1, then Gn = PSL(2,p) = PSL(2, n - 1). 

(1.2*) If t = 2 and p = 7, then G„ = PSL(2, 8) = PSL(2, n - 1). 

(1.3*) If * = 2 and p 7^ 7, then G„ = 

(1.4*) If i > 2 and p / 2, then G„ = A„. 

(1.5*) If p = 2, then G^^ 5„. 

(III.l*) If t = 1, then Gn.q = PSL(2, q) = PSL(2, n - 1). 

*'*That is, say in view of Proposition 1 of [Al], X = oo is the only valuation of k{X)/k which 
is possibly ramified in the splitting field of F„ (resp. Fn,g, Fn, Fn^q) over K{X). 

*^As said earlier, my work on this paper started when, in September 1988, Serre told me that 
he could prove (I.l*). As he now tells me, his proof, which also applies to (IILl*), uses "descending 
Galois theory" which is different from my method which he calls the "ascending" method. With 
Serre's kind permission, his letter to me, dated 15 November 1990, describing his "descending" 
proof, is appended herewith. 
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As noted in the said Summary and Note, on the one hand, CT was not used in 
the proofs of (I.l*), (1.4*), (1.5*), and (III.l*), and on the other hand, the proof 
(1.2*) was complete only modulo the reducibility of the polynomial ^{U) which 
will be established in [A7]. Actually, CT was used only in the sense that, assuming 
t = 2, we first showed G„ to be a 3-transitive permutation group of degree n and 
from this by CTT we deduced that G„ = or Sn- Now the 3-transitivity tells 
us that \Gn\ > n(n — l)(n — 2) and hence if p = 3 then obviously Gn = An or 
Sn- Likewise, for p = 5 we need not invoke CT because again in that case, say in 
view of the following elementary theorem which occurs as item IV on page 148 of 
Carmichael's book [Ca],^^ the 3-transitivity directly tells us that G„ = An or 5„. 

Carmichael's Theorem. A„ and Sn are the only l-transitive groups of degree n 
for which Z > [-^ + 1] where + 1] is the greatest integer not exceeding ^ + 1- 

By Corollaries (3.5) to (3.10) of the Substitutional Principle, the Galois groups 
Gn,q = Ga.l{Fn,q,k{X)) and Gn = Gal{Fn,k{X)) have the same description as 
the above description of the Galois groups G„ = Gal(F„, fc(X)) and Gn,q = 
Gal{Fn,q,k{X)). Let us summarize this in the following 

Summary about the roof polynomial. Let k be an algebraically closed field 
of characteristic p ^ 0, let 7^ a S fc, and let s and t be positive integers with 
t ^ 0(p). Then the polynomials F„ = F" - aX^Y* + 1 with n=p + t and Fn,q = 
— aX^Y* + 1 with n = q + t and q = & positive power of p, give unramified 
coverings of the afhne line over k, and for their Galois groups G„ = Gal(i^„, A;(X)) 
and Gn,q = Gal(-F„,g, k{X)) we have the following. 
(LI) lit = l, then G„ = PSL(2,p) = PSL(2,n- 1). 

(1.2) If t = 2 and p = 7, then G„ = PSL(2, 8) = PSL(2, n - 1). 

(1.3) If t = 2 and p 7^ 7, then G„ = A„. 

(1.4) If t > 2 and p 7^ 2, then G„ = A„. 

(1.5) If p = 2, then Gn = S„. 

(III.l) If i = 1, then Gn,q = PSL(2, q) = PSL(2, n - 1). 

Here, CT is not used in the proofs of (I.l), (L4), (1.5) and (III.l); likewise, it 
is not used in the proof of (1.3) for p < 7. Moreover, referring to the polynomial 
^(U) obtained by taking q = p = 7 in item (2**) of the §21, the proof of (1.2) is 
complete only modulo the reducibility of "^{U) in k{z,co) to be established in [A7]. 

Now what does the "decreasing induction" in the title of this section refer to? 
Roughly speaking, it says that if we can find an unramified covering of the afBne 
line with Galois group An then we can find one with Galois group A^—i- If this 
were so without any qualification, then an An covering for large n would yield an 
An covering for all smaller n. But there is a qualification! More precisely, we have 
the following 

Method of decreasing induction. Assuming that we are working over an al- 
gebraically closed field k of characteristic p 7^ 0, let there be given an irreducible 



*®The statement of Charmichael's Theorem given on page 154 of Volume I of Huppert [HB] 
says that A„ and S„ are the only i-transitive groups of degree n for which I > ^. This seems 
incorrect bcacausc for (I, n) = (4, 11) we have 4 > 4^ but the Mathieu group Mn is a 4-transitive 
group of degree 11, and for (l,n) = (5,12) we have 5 > ^ but the Mathieu group M12 is a 
5-transitive group of degree 12. 
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n-fold unramifiod covering of the affinc line by the affine line,**^ such that the point 
at infinity sphts exactly into two points, say the origin and the point at infinity, 
with ramification exponents p and t = n — p^Q{p)^ and let G be the Galois group 
of the given covering.** Now if G = A„ with n > 5 then we can find an unramified 
covering of the affine line with Galois group An-i- More generally, without any 
condition on n or G, we can find an unramified covering of the affine line with 
Galois group G* <i Gi = the 1-point stabilizer of G such that Gi /G* is cyclic of 
order nondivisible by p. 

To prove this informally, say the original line is the X-axis, and the covering 
line is the F-axis, and let C be the least Galois covering of the X-axis "containing" 
(or, "dominating") the F-axis. Then the Galois group of G over the F-axis is the 
one-point stabilizer G\ of G. Moreover, the origin of the F-axis is tamely ramified 
in G, say with reduced ramification exponent t* and the point at infinity of the 
F-axis is the only other point which is possibly ramified in G. Now considering 
the F*-axis with a*Y*'^ = Y with ^ a* € k and r = 0{t*), and passing to the 
"compositum" G* of G and the F*-axis, by MRT^° we see that G* is an unramified 
covering of the affine line = the F*-axis, and upon letting G* to be the Galois group 
of G* over the F*-axis, by Corollary (3.1) of the Substitutional Principle we see 
that G* <Gi = the 1-point stabilizer of G, and Gi/G* is cyclic of order nondivisible 
by p. _ 

To apply this method to the polynomial F„, from the initial material^^ of §21 
we recall that^^ by putting X = x and reciprocating the roots of F„ we get the 
polynomial 

e(F) ^YP+* + xYP + l 

and by letting y be a root of 0(F) in an overfield of k{x) we have k{x,y) = k{y), 
and by letting ©'(F) be the twisted F-derivative of ©(F) at y, and letting A(F) 
be the polynomial obtained by reciprocating the roots of 8'(F), and letting A.{Z) 
be the polynomial obtained by multiplying the roots of A{Z) by ty, we get 

A{Z) = ^{Z){Z + tf - tP-'^y-P-*Z\ 

where 

^{z) = t-\z + tY -z% 

Now A(Z) is irreducible in k{y)[Z] and has no multiple roots in any overfield of 

k{y), and upon taking 1-point stabilizers, by (1.3*), (1.4*), and (1.5*) we get the 
following where CT is used only in the p > 5 case of (1.3**). 

(1.3**) If i = 2 and p 7^ 7, then Gal(A(Z), k{y)) = A„_i. 

(1.4**) If t > 2 and p ^ 2, then Gal(A(Z), k{y)) = An-i. 

(1.5**) If p = 2, then Gal(A(Z), k{y)) = Sn-i- 



^'^That is, a covering of the projective line by the projective line, which is unramified over the 
aifine line. 

**That is, G is the Galois group of the associated least Galois covering. 

*^By the argument on pages 843-845 of [A3] we see that t* divides LCM(t, (p— 1)!) and hence 
the origin of the Y-axis is tamely ramified in C 
= pages 181-186 of [A5]. 
^^Up to formula (*). 

^^Until further notice q = p and n = p + t with positive integer t ^ 0(p). 
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Remembering that s is any positive integer and a is any nonzero element of k, 
we let 

AsAZ) = -f{Z)iZ + t)P - aY-'Z* 

and we note that if s = 0{p+t) then As_a(Z) can be obtained from A{Z) by replacing 
y by a'y/(P+*), where a' e fc is such that tP-^arP"^ = a. In view of the LCM 
Theorem, ^'^ by the first ramification diagram''"' in §21, we see that LCM(p— l.t) is 
divisible by the reduced ramification exponent of every extension of the valuation 
y = of k{y)/k to a splitting field of A{Z) over k{y), and no valuation of k{y)/k, 
other than the valuations y = and ?; = oo, is ramified in the said splitting field. 
Therefore, if s = 0((p + 1) LCM(p — 1, t)) then by MRT we see that no valuation of 
k{Y)/k, other than the valuation Y = oo, is ramified in a splitting field of A^^aiZ) 
over k{Y), and obviously As^a{Z) has no multiple roots in any overfield of k{Y), and 
in view of Corollaries (3.3), (3.5), and (3.8) of the Substitution Principle, by (1.3**), 
(1.4**), and (1.5**) we see that (3') if t = 2 and p ^ 7, then Gal(A,,a(^), k{Y)) = 
An-i, whereas (4') if t > 2 and p^2, then Gal(As.a(Z), k{Yj) = A„_i, and finally 
(5') if p = 2, then Gal(As.a{Z), k{Y)) — Sn-i- Thus we have the following where 
CT is used only in the p > 5 case of (1.3'). 

(1.0') If s = 0((p + t)LCM(p- l,t)) then no valuation of k{Y)/k, other than the 
valuation Y = oo, is ramified in a splitting field of Ag^aiZ) over k{Y), and As_a{Z) 
has no multiple roots in any overfield of k{Y). 

(1.3') If i = 2 andp ^ 7 and s = 0{{p+t) LCM(p-l, t)), thenGal(As,a(Z), k{Y)) = 

An-l. 

(1.4') Ifi > 2and39^ 2ands = 0((39+i) LCM(p-l, i)), thenGal(As,a(Z), fc(y)) = 

An-l- 

(1.5') Ifp = 2 and s = 0((p + 1) LCM(p - 1, i)), then Gal{As,aiZ), k{Y)) = Sn-i- 
Actually, the above four assertions remain valid if we replace the assumption that 
s = 0{{p + t) LCM(p - 1, i)) by the weaker assumption that s = 0(LCM(p - 1, t)). 
To see this, first note that A{Z) can be obtained from Ai_i(Z) by replacing Y by 
t^~PyP^*\ now since no valuation of k{y)/k, other than the valuations y = and 
y = oo, is ramified in a splitting field of A{Z) over k{y), it follows that no valuation 
of k{Y)/k, other than the valuations Y = and Y — oo, is ramified in a splitting 
field of Ai,i(Z) over k{Y); moreover since A{Z) is irreducible in k{y)[Z] and has 
no multiple roots in any overfield of k{y), it follows that Ai.i(Z) is irreducible 
in k{Y)[Z] and has no multiple roots in any overfield of k{Y); finally, in view of 
Corollary (3.1) the Substitutional Principle, by (1.3**) to (1.5**) we see that (3***) 
if t = 2 and p / 7, then Gal(Ai,i(Z), A;(V)) = An-i or Sn-u and (4***) if t > 2 
and p ^ 2, then again Gal(Ai,i(Z), A;(y)) = A^-i or Sn-i, and (5***) if p = 2, 
then Gal{Ai^i{Z),k{Y)) = Sn-i- Again, As,a{Z) can be obtained from Ai^i(Z) 
by substituting aY^ for Y and hence, in view of Corollaries (3.2) to (3.8) of the 
Substitutional Principle, we get the following where CT is used only in the p > 5 
case of (1.3"). 

(I.O") No valuation of k{Y)/k, other than the valuations Y — and Y = oo, 
is ramified in a splitting field of As_o(Z) over k{Y), and Ag^aiZ) is irreducible^^ in 
fc(F)[Z] and has no multiple roots in any overfield of k{Y). 

^^That is, Proposition 7 on page 845 of [A3]. 

®'*Together with its explanation in the three paragraphs following it. 

^^The irreducibility of As,a follows from the First Irreducibility Lemma. Alternatively, it 
follows from the fact that a polynomial is irreducible if and only if its Galois group is transitive. 
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(1.3") If i = 2 and p 7^ 7, then Gal(A^,a(^), k{Y)) = A„_i or 
(1.4") If t > 2 and p ^ 2, then Gal(A^,a(^), k{y)) = ^n-i or Sn-i- 
(1.5") If p = 2, then Gal(A,,a(^),fc(r)) = Sn-i- 

Now 7(0) ^ ^ t and, upon letting Z' to be a root of Ai i(Z) in an overfield of 
k{Y), we have 



and hence the valuation y = of k{Y)/k splits in k{Y,Z') = k{Z') into the 
valuations Z' = and Z' = 00 with reduced ramification exponents t ^ 0{p) 
and p — 1 ^ 0(p) respectively, and therefore by the LCM Theorem we see that 
LCM(p — 1, i) is divisible by the reduced ramification exponent of every extension 
of the valuation F = of k(Y)/k to a splitting field of A(Z) over k(Y), and hence 
by MRT we see that if s = 0(LCM(p — l,t)) then no valuation of k{Y)/k, other 
than the valuation F = 00, is ramified in a splitting field of As^a{Z) over k{Y), and 
therefore by Result 4 on page 841 of [A3] we know that Ga\( As. a{Z), k{Y)) is a 
quasi p-group. Thus, in view of (I-O"), (1-3"), (1-4"), and (1-5"), we conclude with 
the following where CT is used only in the p> 5 case of (IV. 2'). 

(IV.O') If s = 0(LCM(p - l,t)), then no valuation of k{Y)/k, other than the 
valuation Y = 00, is ramified in a splitting field of As^a{Z) over k{Y), and Ag^a{Z) 
is irreducible in A;(y)[Z] and has no multiple roots in any overfield of k{Y). 

(IV.l') If t > 2 and p 7^ 2 and s = 0(LCM(p - l,t)), thenGal{As,a{Z),k{Y)) = 

An-l. 

(IV.2') If t = 2 and p 7^ 7 and s = 0(LCM(p - l,t)), thenGal(As,a(^), k{Y)) = 

{IV A') If p = 2 and s = 0(LCM(p - 1, t)), then Gal(A,,a(Z), fc(F)) = Sn-i. 

As we have said, CT is used only when t = 2; now in the case t = 2 we have 
jiZ) = Z + 1 and hence As,aiZ) = (Z + 1)(Z + 2)p - aY'^Z^; therefore by (I.O") 
we see that if i = 2 then As^a{Z) ^ {Z+l){Z + 2)p - oY-^'Z^ is unramified outside 
y = and Y = QG. For getting hold of a variation of (1.3"') without CT, let us 
reprove the said unramifiedness in a more general context. So let 



with 1 <T <p and 0^ b € k. Now for the (ordinary) Z-derivative we have 



Y = 



j{Z'){Z'+t)P 



E{Z) = {Z+1){Z + bf - YZ'' 



Ez{Z) = {Z + bY-rYZ' 



■T-1 



and hence 



E{Z) = {Z + l)Ez{Z) + E*{Z) 



where 




•T-1 
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and therefore for the Z-discriminant we have 

Discz{E{Z)) = Resz{E{Z),Ez{Z)) 
= Resz{E*{Z),Ez{Z)) 

= {T-lfYPEzW-^Ez 

= (r - i)f 

= (r - fof (^-1) (-tY) (^^\ if 6 = ^— 

\T — 1 J T — 1 

= (-r)^(T - if 6 = ^— 

r — 1 

and hence if 6 = ^^r^j then no valuation of k{Y)/k, other than the valuations Y = 
and y = 00, is ramified in a splitting field of E{Z) over k{Y), and has no 

repeated roots in any overfield of k(Y). Therefore, upon remembering that a is any 
nonzero element of k and s and t are any positive integers with t ^0{p), and upon 
letting 

Es,a{Z) = {Z + 1){Z + b)P - aY-'Z* 

we get the following. 

(IV.O**) If 1 < t < p and b = then no valuation of k{Y)/k, other than the 
valuations Y = and y = oo, is ramified in a splitting field of Es,a{Z) over k{Y), 
and Es,a{Z) is irreducible^'' in A;(y)[Z] and has no multiple roots in any overfield 
of k{Y). 

Before proceeding further, let us make note of the following 

Alternate Corollary of the Fourth Irreducibility Lemma. // F{Z) is a 
monic irreducible polynomial of degree p + 1 in Z with coefficients in k{Y) such 
that some valuation of k{Y)/Y , say the valuation Y = oo, splits in a root field of 
F{Z) over k(Y) into two valuations with reduced ramification exponents p and 1, 
then the Galois group Gal(-F(Z), A;(y)) is 2-transitive. 

For a moment suppose that 1 < t < p and b = . Now upon letting Z* to be 
a root of Ei^i{Z) in some overfield of k{Y), we have 



{Z* + 1){Z* + b)P 

and hence the valuation y = oo of k{Y)/k splits in k{Y,Z*) = k{Z*) into two 
valuations with reduced ramification exponents p and 1 and therefore, by the above 
Alternate Corollary, Gal{Ei^i{Z), k{Yj) is 2-transitive. By the above equation for 
y we also see that the valuation y = of k{Y)/k splits in k{Z*) into the valuations 
Z* = and Z* = oo with reduced ramification exponents t ^ 0(p) and p + I — t ^ 
0{p) respectively. Therefore on the one hand, by the LCM Theorem and MRT we 
see that if s = 0{t{p +l — t)) then no valuation of k{Y) /k, other than the valuation 





^^For a while there will be no reference to n. 

^^The irreducibility of E^^^^i, follows from the First Irreducibility Lemma. 
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F = 00, is ramified in a splitting field of Es^a{Z) over k{Y), and hence by Result 
4 on page 841 of [A3] we see that Gal(i?s^a(Z), kiY)) is a quasi p-group, and so if 
we already knew that Gdl{Es,a{Z)Tk{Y)) — Ap+i or S'p+i then we would conclude 
that Gal(ii^s^a(Z), ~ Ap+i. On the other hand, by the Cycle Lemma we see 

that if GCD(p+ l — t,t) = \ then Ga\{Ei^i{Z), k{Y)) contains a i-cycle, and hence 
by Marggraff 's First Theorem we see that if also t < ^±1 then Gal{Ei^i{Z), k{Y) = 
Ap^i or Note that if p > 5 and t is an odd prime factor of then 

either t is an odd prime factor of or t is an odd prime factor of and in 
both the cases 1 < t < and GCD(p + l,t) = 1. Also note that, in the case 
p > 5, there do exist odd prime factors of IfinlKEzli^ because otherwise we would 
have p — 1 = 2™ with m > 2 and p — 3 = 2™ with m' > 1, and this would give 
2m'-i ^ Ez^ ^ Enizil ^ 2™-i - 1 which woidd be a contradiction since 2"'-! is 
even and 2™-i - 1 is odd. Thus, without using CT, we have proved the following. 

(IV.O*) If j3 > 5 then t can be chosen so that 1< t < ^±1 andGCD(p+ = 1, 
and for any such t, upon assuming b = and s = Q{t{p + 1 — t)), we have that 
no valuation of k{Y)/k, other than the valuation Y = oo, is ramified in a splitting 
field Es.a{Z) over kiY), and Es.a{Z) is irreducible in k{Y)[Z] and has no multiple 
roots in any ovcrficld of k{Y). 

(IV.3*) If p > 5 then t can be chosen so that l<t < ^ andGCD(p+ 1, = 1, 
and for any such t, upon assuming b = and s = Q{t{p + I — t)), we have 
Gal{E,,a{Z),k{Y))=Ap+i. 

The above two results with (Y, Z) changed to (AT, Y) , together with the results 
(IV.O'), (IV.l'), (IV.2'), and (IV.4') with {n,Y,Z) changed to(n + 1, A, y),^^ may 
be summarized in the following 

Summary about the primed roof polynomial. Let k be an algebraically closed 
field of characteristic p 7^ 0, let a, 6 be nonzero elements in k, let n,s,t be positive 
integers with n + 1 ^ 0(p) and n> t ^0{p), and consider the monic polynomial of 
degree n in y with coefficients in k{X) given by 

= h{Y) {Y + bf -aX-'Y^ with 0^b€k, 

where h{Y) is the monic polynomial of degree n—p in Y with coefficients in k given 

by 

_ (y + n + - y"+i^p 

Then in the following cases is irreducible in k{Y)[Z], has no multiple roots in 
any overfield of k{Y), and gives an unramified covering of Lfe with the indicated 
Galois group G'„ = Ga.l{F^, fc(A)). 

(IV.l) If n + 1 - p = t > 2 ^ p and 6 = t and s = 0(p - 1) and s = 0{t), then 

G'n = An. 

(IV.2) If n + 1 -p = t = 2 and p 7^ 7 and & = t and s = 0(p - 1), then G'„ = A„. 
(IV.3) If n = p + 1 and p > 5, then t can be chosen so that 1 < t < and 
GCD(p+l, t) = 1, and for any such t, upon assuming b = and s = 0{t{p+l—t)), 

we have G' = An- 



This was inspired by discussions with Walter Feit. 
99 Note that with these changes, ifn+l-p = t^ 0(p) then 7(y) = (Y+n+ir+^^^-Y"-+'^-P _ 
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(IV.4) lin+l-p = t and p = 2 and b = t and s = 0{t), then G'„ = 5„. 
Here CT is used only in the p> 5 case of (IV. 2). 

Referring to the summaries about the tilde polynomial and the roof polynomial 
and the primed roof polynomial, we have established the following four corollaries 
and we have arranged a proof of the First and the Second Corollaries independent 
of CT. 

First Corollary. For any n > p > 2, there exists an unramified covering of the 
affine line in characteristic p whose Galois group is An. 

Second CoroUciry. For any n>p = 2, there exists an unramified covering of the 
affine line in characteristic p whose Galois group is 5„ . 

Third Corollary. Unramified coverings of the affine line in characteristic p with 

a few more Galois groups have been constructed. 

Fourth Corollary. Let G be a quasi p-group. Assume that G has a subgroup H 
of index p + 1 such that H does not contain any nonidentity normal subgroup of 
G.^*"^ Also assume that p is not a Mersenne prime and p is different from 11 and 
23. Then there exists an unramified covering of the affine line in characteristic p 
having G as Galois group. 

It only remains to note that, in view of the said three Summaries, the above 
Forth Corollary follows from CTT, Special CDT, and the Corollary of the Fourth 
Irreducibility Lemma given in §21. Moreover, given any n > p > 2, as a definite 
alternative for getting an (>l„)-covering as asserted in the First Corollary without 
CT: if p+2 < n ^ 0{p) then use (1.4) with t = n—p; if n = p+2 andp < 7 then use 
(1.3) with t = 2;ifn = p + 2 and p > 7 then use (IV. 1) with t = 3 ands = t{p — 1); 
if n = p + 1 and p < 7 then use (IV. 2) with t = 2 and s = p — 1; ifn=p+l 
and p > 7 then use (IV.3) with t = the smallest odd prime factor of and 
with s = t{p + 1 — t), [note that in case of n = 8 and p = 7 this gives t = 3 and 
s = 15]; if n = 0(p) then use (II. 1) with t = 2. Likewise, given any n > p = 2, as a 
definite alternative for getting an (S'„)-covering as asserted in the Second Corollary 
without CT: if n ^ 0(p) then use (1.5) with t = n — p; if n = 0(p) then use (IV.4) 
with t = n + 1 — p and s = t. 

The above cited three summaries are transcribed from my e-mail message to 
Serrc dated 28 August 1989. This was only one out of the nearly a himdrcd c-mail 
and s-mail messages which flashed back and forth between him and me in the two 
year period September 1988 to September 1990. Indeed it has been a tremendous 
pleasure working with him. So once again my hearty MERCI MON AMI to Serre. 
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Appendix by J. -P. Serre* 
Harvard, November 15, 1990 

Dear Abhyankar, 

Here is my original proof that PSL2(Fg) occurs for the equation Y'^~^^ —XY-\-l = 

0. 

I use "descending Galois theory," i.e., I construct a priori the Galois covering 
one wants. This is different from your "ascending" method; in particular, I don't 
need any characterization of PGL2(Fg), or PSL2(Fg), as a permutation group on 
q+1 letters. 

Notation. 

p is a prime; 5 is a power p*^ of p. 

G = PGL2(Fg), i.e., the quotient of GL2(Fq) by its center F*. 
k is an algebraically closed field of characteristic p; all the curves I consider are 
over k. 

Preliminary construction. I start from the obvious fact that G acts in a natural 
way (by "fractional linear transformations") on the projective line Pi. In algebraic 
terms this means that G acts on k{t) hy t {at + b)/{ct + d). The quotient curve 
P[ = Pi/G is of course (Liiroth's theorem!) a projective line. Equivalently, the 
field of invariants of G in k{t) is a purely transcendental field k{x). 

The first computational problem which arises is to write x explicitly. To do so, 
let us call (u, v) the homogeneous coordinates on Pi, so that t = v/u. The invariant 
theory of k[u, v] with respect to the action of G has been done long ago by Dickson. 
The basic covariants are the following homogeneous polynomials: 

2 2 

A{u,v) = uv'' — vu"^, B{u,v) = {uv'' — vu'^ )/A{u,v). 



*The author expresses his appreciation to J. P. Serre for permission to include the following 
letter in this paper. 
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They are of degree q + 1 and — q respectively. Hence the ratio 

x = B{u,vy+^/A{u,vy^-'^ 

is invariant by G. Its expression in terms of t is easy to find: if we write A{u, v) = 
u^+'^a{t), B{u,v) = ui^-ib{t), we have 

a{t) =t'3-t, b{t) = - -t)= a{ty-'^ + 1 

and 

(*) X = b{ty+^/aity'-^ = ia{ty-' + l)''+Va(t)«'-«. 

This shows that x is a rational function of t of degree q{q^ — 1). Since it is invariant 
by G, which has order g(g^ — 1), Galois theory shows it generates the field of the 
G-invariant elements of k{t). Hence we have found our parameter for P'^ = Pi/G. 

Ramification. It is necessary to study the ramification in the Galois extension 

k{f,)/k{x) thus constructed. This amomits to looking for the fixed points of the 
action of G on the projective line Pi. It is easy to see that these fixed points make 
up two orbits. Namely: 

(a) The Fq-rational points of Pi. This orbit has q + 1 elements. The stabilizer 
of an element is a triangular subgroup ("Borel subgroup") of order q{q — l). Since 
that order is divisible by p, there is wild ramification. 

The point of P^ corresponding to this orbit is a; = oo. 

(b) The "quadratic" points, i.e., the Fq2 -rational points of Pi which are not 
rational over Fg. There are q^ —qoi them. The stabilizer of such a point is a cyclic 
group of order q+l ("nonsplit Cartan subgroup"). Since that order is prime to p, 
the ramification at such a point is tame. 

The point of P'l corresponding to this orbit is a; = 0. 

Hence we see that the covering of P^ we get in this way is ramified both at and 
oo, and nowhere else. The next step is thus: 

Getting rid of the ramification at using Abhyankar's lemma. We consider 

the cyclic extension k{X) of k{x) defined by the equation X''^^ = x. By making a 
base change to that extension (i.e., by considering k(t^ X)/k{X)) we get rid of the 
ramification at 0. Only the ramification at oo remains. Of course, one has to see 
what the new Galois group is. There are two cases: 

(i) p = 2. The extensions k{X)/k{x) and k{t)/k{x) are disjoint. Hence the new 
Galois group is equal to the old one, namely G = PGL2(Fg), which happens to be 
equal to PSLaCF,). 

(ii) p ^2. The extensions k{X)/k{x) and k(t)/k(x) have a quadratic extension 
in common, namely k{x^/'^). Hence the new Galois group is G' = PSL2(Fq). 

In both cases, one thus gets a Galois extension of k{X) with Galois group 
PSL2(Fg) which is ramified only at X = oo. 

It remains to see that this extension is the same as the one you get by the 
equation -XY+1 = 0. 
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An equation for the degrees q + 1 extension. I go back to the k{t)/k{x) 
extension with Galois group G. Let H be the triangular subgroup of G, of index 
q + 1. The fixed field k{t)^ is an extension of degree q + 1 oi k{x) and we want 
to find a generator y for that field (which is also a purely transcendental field, of 
course). We may assume that H is the group of transformations t at + b. This 
shows that the polynomial 

y = a{ty-' = (t« - ty-' 

is invariant by H. Since its degree is q{q — 1) = the same argument as above 
shows that k{y) is equal to k{t)^ . 

I now write the equation of degree g + 1 relating y to x. This is easy, since by 
construction, we have x = {y -\- cf. (*) above. We thus get the equation 

{y + 1)9+1 _ ^^yq ^ g_ 

But I want to work on k{X), with x = X'^^^ . We have: 

{y + 1)9+1 _ xl+^yq ^ Q 

Let me put Y = {y + l)/yX, i.e., y = 1/{XY — 1). The above equation becomes 

y9+i -XY+1=0, 

and wc arc done. 

This is the proof I found in 1988 when I started thinking about your problem. 
The first part is natural enough — and could indeed be applied to other groups, a la 
Nori. The second part (the search for the degree q+l equation) is not; it looks like 
a happy coincidence, and I would not have found it if I had not known in advance 
your polynomial — XY + 1. 

With best regards, 
Yours 

J-P. Serre 

PS The determination of the invariants of G in k{t) is not new. I am almost certain 
to have seen it in print very long ago, as an elementary exercise in Galois theory. 
(Indeed: see Lang's Algebra, 2nd ed., p. 349, exercise 33, and also P. Rivoire, Ann. 
Inst. Fourier 6 (1955-1956), pp. 121-124.) 

(Shreeram S. Abhyankar) MATHEMATICS Department, Purdue University, West Lafayette, 
Indiana 47907 



